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Abstract 

We give the first polynomial-time approximation scheme (PTAS) for the Steiner forest prob- 
lem on planar graphs and, more generally, on graphs of bounded genus. As a first step, we show 
how to build a Steiner forest spanner for such graphs. The crux of the process is a clustering 
procedure called prize- collecting clustering that breaks down the input instance into separate 
subinstances which are easier to handle; moreover, the terminals in different subinstances are 
far from each other. Each subinstancc has a relatively inexpensive Steiner tree connecting all 
its terminals, and the subinstances can be solved (almost) separately. Another building block 
is a PTAS for Steiner forest on graphs of bounded treewidth. Surprisingly, Steiner forest is 
NP-hard even on graphs of treewidth 3. Therefore, our PTAS for bounded treewidth graph 
needs a nontrivial combination of approximation arguments and dynamic programming on the 
tree decomposition. We further show that Steiner forest can be solved in polynomial time for 
series-parallel graphs (graphs of treewidth at most two) by a novel combination of dynamic 
programming and minimum cut computations, completing our thorough complexity study of 
Steiner forest in the range of bounded treewidth graphs, planar graphs, and bounded genus 
graphs. 

1 Introduction 

One of the most fundamental problems in combinatorial optimization and network design with both 
practical and theoretical significance is the Steiner forest problem, in which given a graph G = (V, E) 
and a set consisting of pairs of terminals, called demands, V = {(si,ii), (52,^2); • • • > (s/bifc)}' the 
goal is to find a minimum-cost forest F of G such that every pair of terminals in T> is connected 
by a path in F. The first and the best approximation factor for this problem is 2 due to Agrawal, 
Klein and Ravi [AKR95] (see also Goemans and Williamson [GW95J). The conference version 
of the Agrawal, Klein and Ravi [AKR91] appeared in 1991, and there have been no improved 
approximation algorithms invented for Steiner forest. Recently Borradaile, Klein and Kenyon- 
Mathieu [BKM08J obtain a Polynomial Time Approximation Scheme (PTAS) for Euclidean Steiner 
forest where the terminals are in the Euclidean plane. They pose obtaining a PTAS for Steiner 
forest in planar graphs, the natural generalization of Euclidean Steiner forest, as the main open 
problem. We note that in network design, planarity is a natural restriction, since in practical 
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scenarios of physical networking, with cable or fiber embedded in the ground, crossings are rare 
or nonexistent. In this paper, we settle this open problem by obtaining a PTAS for planar graphs 
(and more generally, for bounded genus graphs) via a novel technique of prize- collecting clustering 
with potential use to obtain other PTASs in planar graphs. 

The special case of the Steiner forest problem when all pairs have a common terminal is the 
classical Steiner tree problem, one of the first problems shown NP-hard by Karp [Kar75j . The 
problem remains hard even on planar graphs [G J77] . In contrast to Steiner forest, a long sequence 
of papers give approximation factors better than 2 for this problem [Zel92l [Zel93, BR92J IZel96[ 
PS97] [KZ971 IHP991 IRZ05] ; the current best approximation ratio is 1.55 [RZ05]. Since the problem 
is APX-hard in general graphs [BP89 . [Thi03] . we do not expect to obtain a PTAS for this problem in 
general graphs. However, for the Euclidean Steiner tree problem, Arora |Aro98j and Mitchell (\ I it 95 
present a PTAS. Obtaining a PTAS for Steiner tree on planar graphs, the natural generalization of 
Euclidean Steiner tree, remained a major open problem since the conference version of Arora [Aro96j 
in 1996. Only in 2007, Borradaile, Kenyon-Mathieu, and Klein [BKMK07J settle this problem with 
a nice technique of constructing light spanners for Seiner trees in planar graphs. In this paper, we 
also generalize this result to obtain light spanners for Steiner forests. 

Most approximation schemes for planar graph problems use (implicitly or explicitly) the fact 
that the problem is easy to solve on bounded-treewidth graphs. In particular, a keystone blackbox 
in the algorithm of Borradaile et al. [BKMK07] for Steiner tree is the result that, for every fixed 
value of k, the problem is polynomial-time solvable on graphs having treewidth at most k. There 
is a vast literature on algorithms for bounded-treewidth graphs and in most cases polynomial-time 
(or even linear-time) solvability follows from the well-understood standard technique of dynamic 
programming on tree decompositions. However, for Steiner forest, the obvious way of using dy- 
namic programming does not give a polynomial-time algorithm. The difficulty is that, unlike in 
Steiner tree, a solution of Steiner forest induces a partition on the set of terminals and a dynamic 
programming algorithm needs to keep track of exponentially many such partitions. In fact, this 
approach seems to fail even for series-parallel graphs (which have treewidth at most 2); the com- 
plexity of the problem for series-parallel graphs was stated as an open question by Richey and 
Parker [RP86J in 1986. We resolve this question by giving a polynomial-time algorithm for Steiner 
forest on series-parallel graphs. The main idea is that even though algorithms based on dynamic 
programming have to evaluate subproblems corresponding to exponentially many partitions, the 
function describing these exponentially many values turn out to be submodular and it can be rep- 
resented in a compact way by the cut function of a directed graph. On the other hand, Steiner 
forest becomes NP-hard on graphs of treewidth at most 3. Thus perhaps this is the first example 
when the complexity of a natural problem changes as treewidth increases from 2 to 3. In light 
of this hardness result, we investigate the approximability of the problem on bounded-treewidth 
graphs and show that, for every fixed k, Steiner forest admits a PTAS on graphs of treewidth at 
most k. The main idea of the PTAS is that if the dynamic programming algorithm considers only 
an appropriately constructed polynomial-size subset of the set of all partitions, then this produces 
a solution close to the optimum. Very roughly, the partitions in this subset are constructed by 
choosing a set of center points and classifying the terminals according to the distance to the center 
points. Our PTAS for planar graphs (and more generally, for bounded genus graphs) uses this 
PTAS for bounded-treewidth graphs. This completes our thorough study of Steiner forest in the 
range of bounded treewidth graphs, planar graphs and bounded genus graphs. 
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1.1 Our results and techniques 

Our main result in this paper is a PTAS for the planar Steiner forest problem. 

Theorem 1. For any constant e > 0, there is a polynomial-time (1 + ^-approximation algorithm for 
the Steiner forest problem on planar graphs and, more generally, on graphs of bounded genus. 

To this end, we build a Steiner forest spanner for the input graph and the set of demands in 
Section |4) Roughly speaking, a Steiner forest spanner is a subgraph of the given graph whose cost 
is no more than a constant factor times the cost of the optimal Steiner forest, and furthermore, it 
contains a nearly optimal Steiner forest. Denote by OPTx>(G) the minimum cost of a Steiner forest 
of G satisfying (connecting) all the demands in T>. We sometimes use OPT instead of OPTx>(G). 
A subgraph H of G is a Steiner forest spanner with respect to demand set T> if it has the following 
two properties: 

Spanning Property: There is a forest in H that connects all demands in T> and has length at 
most (1 + e)OPT v (G), namely, OPT v (H) < (1 + e)OPT v (G). 

Shortness Property: The total length of H is at most f(e)OPTx>(G). 

Theorem 2. Given any e > 0, a bounded genus graph Gi n {Vi n ,Ei n ) and demand pairs V, we can 
compute in polynomial time a Steiner forest spanner H for Gi n with respect to demand set V. 

The algorithm that we propose achieves this in time 0(n 2 logn). The entire algorithm for 
Steiner forest runs in polynomial time but the exponent of the polynomial depends on e and the 
genus of the input graph. 

The proof of Theorem [2] heavily relies on a novel clustering method presented in Theorem [3] 
that allows us to (almost) separately build the spanners for smaller and far apart sets of demands. 
Construction of the spanner for each of the sets itself uses ideas of Borradaile et al. [BKMK07] . 
although there are still several technical differences. The clustering technique works for general 
graphs as opposed to the rest of the construction which requires the graphs to have bounded genus. 

Theorem 3. Given an e > 0, a graph Gi n (Vi n: Ei n ), and a set V of pairs of vertices, we can compute 
in polynomial time a set of trees {Ti, . . . , T^}, and a partition of demands {T>\, . . . , T>k}, with the 
following properties. 

1. All the demands are covered, i.e., V = U;=i 

2. All the terminals in T>i are spanned by the tree Tj. 

3. The sum of the costs of all the trees Tj is no more than (- + 2)OPT. 

4. The sum of the costs of minimum Steiner forests of all demand sets T>i is no more than 1 + e 
times the cost of a minimum Steiner forest of Gi n ; i.e., £\ OPTx^Gm) < (1 + e)OPTx>(Gi n ). 

The last condition implies that (up to a small factor) it is possible to solve the demands T>i 
separately. Notice that this may lead to paying for poritions of the solution more than once. 

We will prove Theorem [3] in Section [3j Roughly speaking, the algorithm here first recognizes 
some connected components by running a 2-approximation Steiner forest algorithm. Clearly, this 
construction satisfies all but the last condition of the theorem. However, at this point, these 
connected components might not be sufficiently far from each other such that we can consider 
them separately. To fix this, we contract each connected component into a "super vertex" to 



3 



which we assign a prize (potential) which is proportional to the sum of the edge weights of the 
corresponding component. Now we are running an algorithm that we call prize- collecting clustering. 
The algorithm as well as some parts of its analysis bears similarities to a primal-dual method due 
to Agrawal, Klein and Ravi |AKR9l| and Goemans and Williamson [GW95]. Indeed our analysis 
strengthens these previous approaches by proving some local guarantees (instead of the global 
guarantee provided in these algorithms). In some sense, this clustering algorithm can also be 
seen as a generalization of an implicit clustering algorithm of Archer, Bateni, Hajiaghayi, and 
Karloff [ABHK09J who improve the best approximation factor for prize-collecting Steiner tree to 
2 — e, for some constant e > 0. In this clustering, we consider a topological structure of the graph 
in which each edge is a curve connecting its endpoints whose length is equal to its weight. We color 
(portions of) edges by different colors each corresponding to a super vertex. These colors form a 
laminar family and the "depth" of each color is at most the prize given to its corresponding super 
vertex. Using this coloring scheme we further connect some of these super vertices to each other 
with a cost proportional to the sum of their prizes. At the end, we show that now we can consider 
these combined connected components as separate clusters and, roughly speaking, an optimum 
solution need not connect two different clusters because of the concept of depth. We believe the 
prize-collecting clustering presented in this paper might have applications for other problems (esp. 
to obtain PTASs). 

To obtain a PTAS promised in Theorem[TJ first we construct a spanner Steiner forest provided in 
Theorem [2l On this spanner, we utilize a technique due to Klein [Kle08], and Demaine, Hajiaghayi 
and Mohar [DHM07] which reduces the problem of obtaining a PTAS in a planar (and more 
generally, bounded genus) graph whose total edge weight is within a constant factor of the optimum 
solution to that of finding an optimal solution in a graph of bounded treewidth (see the proof of 
Theorem[T]in Section[5]for more details.) However, there are no known polynomial-time algorithms 
in the literature for Steiner forest on bounded-treewidth graphs, so we cannot plug in such an 
algorithm to complete our PTAS. Therefore, we need to investigate Steiner forest on bounded- 
treewidth graphs. 

Resolving an open question of Richey and Parker [RP86] from 1986, we design a polynomial- 
time algorithm for Steiner forest on series-parallel graphs. Very recently, using completely dif- 
ferent techniques, a polynomial-time algorithm was presented for the special case of outerplanar 
graphs [Gas09] . 

Theorem 4. The Steiner forest problem can be solved in polynomial time for subgraphs of series- 
parallel graphs (i.e., graphs of treewidth at most 2). 

A series-parallel graph can be built form elementary blocks using two operations: parallel 
connection and series connection. The algorithm of Theorem |4] uses dynamic programming on the 
construction of the series-parallel graph. For each subgraph arising in the construction, we find a 
minimum weight forest that connects some of the terminal pairs, connects a subset of the terminals 
to the "left exit point" of the subgraph, and connects the remaining terminals to the "right exit 
point" of the subgraph. The minimum weight depends on the subset of terminals connected to 
the left exit point, thus it seems that we need to determine exponential many values (one for each 
subset). Fortunately, it turns out that the minimum weight is a submodular function of the subset. 
Furthermore, we show that this function can be represented by the cut function of a directed 
graph and this directed graph can be easily constructed if the directed graphs corresponding to 
the building blocks of the series-parallel subgraph are available. Thus, following the construction 
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of the series-parallel graph, we can build all these directed graphs and determine the value of the 
optimum solution by the computation of a minimum cut. 

Surprisingly, it turns out that the problem becomes NP-hard on graphs of treewidth at most 
3 |Gas09] . For completeness, in Section [8] we give a (different) NP-hardness proof, which highlights 
how submodularity (and hence the approach of Theorem [3]) breaks if treewidth is 3. There exist a 
few known problems that are polynomial-time solvable for trees but NP-hard for graphs of treewidth 
2 [ZN98, Mar05, Mar09, RP86j, but to our knowledge, this is the first natural example where there 
is a complexity difference between treewidth 2 and 3. 

Not being able to solve Steiner forest optimally on bounded-treewidth graphs is not an un- 
avoidable obstacle for obtaining a PTAS on planar graphs: the technique of Klein, and Demaine, 
Hajiaghayi and Mohar also can be applied when we have a PTAS for graphs of bounded treewidth. 
In Section [6j we demonstrate such a PTAS: 

Theorem 5. For every fixed w > 1 and e > 0, there is a polynomial-time (1 + e)-approximation 
algorithm for Steiner Forest on graphs with treewidth at most w. 

Note that the exponent of the polynomial in Theorem [5] depends on both e and w; it remains 
an interesting question for future research whether this dependence can be removed. 

The main idea of the PTAS of Theorem [5] is to reduce the set of partitions considered in the 
dynamic programming algorithm to a polynomially bounded subset, in a way that an (1 + e)- 
approximate solution using only these partitions is guaranteed to exist. The implementation of 
this idea consists of three components. First, we have to define which partitions belong to the 
polynomially bounded subset. These partitions are defined by choosing a bounded number of 
center points and a radius for each center. A terminal is classified into a class of the partition 
based which center points cover it. Second, we need an algorithm that finds the best solution 
using only the allowed subset of partitions. This can be done following the standard dynamic 
programming paradigm, but the technical details are somewhat tedious. Third, we have to argue 
that there is a (1 + e)-approximate solution using only the allowed partitions. We show this by 
proving that if there is a solution that uses partitions that are not allowed, then it can be modified, 
incurring only a small increase in the cost, such that it uses only allowed partitions. The main 
argument here is that for each partition appearing in the solution, we try to select suitable center 
points. If these center points do not generate the required partition, then this means that a terminal 
is misclassified, which is only possible if the terminal is close to a center point. In this case, we 
observe that two components of the solution are close to each other and we can join them with 
only a small increase of the cost. The crucial point of the proof is a delicate charging argument, 
making use of the structure of bounded-treewidth graphs, which shows that repeated applications 
of this step results in a total increase that is not too large. 

2 Basic definitions 

Let G{V, E) be a graph. As is customary, let 5(V) denote the set of edges having one endpoint 
in a subset V' C V of vertices. For a subset of vertices V C V ', the subgraph of G induced by V 
is denoted by G[V']. With slight abuse of notation, we sometimes use the edge set to refer to the 
graph itself. Hence, the above-mentioned subgraph may also be referred to by £J[V'] for simplicity. 
We denote the length of a shortest x-to-y path in G as distG(x, y). For an edge set E, we denote 
by £(E) := YleeE c e the total length of edges in E. 
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A collection S is said to be laminar if and only if for any two sets Ci, Ci £ S, we have C\ C C2, 
C2 Q Ci, or Ci n C2 = 0. Suppose C is a partition of a ground set V. Then, C(v) denotes for each 
v £ V the set CgC that contains t> . 

Given an edge e = (u, v) in a graph G, the contraction of e in G denoted by G/e is the result of 
unifying vertices v and w in G, and removing all loops and multiple edges except the shortest edge. 
More formally, the contracted graph G/e is formed by the replacement of u and v with a single 
vertex such that edges incident to the new vertex are the edges other than e that were incident 
with u or v. To obtain a simple graph, we first remove all self-loops in the resulting graph. In case 
of multiple edges, we only keep the shortest edge and remove all the rest. The contraction G/E' is 
defined as the result of iteratively contracting all the edges of E' in G, i.e., G/E' := G/ex/ei/ . . . /ejt 
if E' = {ei,e2, • • • , e&}. Clearly, the planarity of G is preserved after the contraction. Similarly, 
contracting edges does not increase the cost of an optimal Steiner forest. 

The boundary of a face of a planar embedded graph is the set of edges adjacent to the face; it 
does not always form a simple cycle. The boundary dH of a planar embedded graph H is the set 
of edges bounding the infinite face. An edge is strictly enclosed by the boundary of H if the edge 
belongs to H but not to dH. 

Now we define the basic notion of treewidth, as introduced by Robertson and Seymour [RS86J. 
To define this notion, we consider representing a graph by a tree structure, called a tree decompo- 
sition. More precisely, a tree decomposition of a graph G(V, E) is a pair (T, B) in which T(I, F) is 
a tree and B = {Bi \ i £ 1} is a family of subsets of V(G) such that 1) \J ieI Bi = V; 2) for each 
edge e = (it, v) £ E, there exists an i £ / such that both u and v belong to Bf, and 3) for every 
v £ V , the set of nodes {i £ 1 1 v £ Bi} forms a connected subtree of T. 

To distinguish between vertices of the original graph G and vertices of T in the tree decom- 
position, we call vertices of T nodes and their corresponding B^s bags. The width of the tree 
decomposition is the maximum size of a bag in B minus 1. The treewidth of a graph G, denoted 
tw(G), is the minimum width over all possible tree decompositions of G. 

For algorithmic purposes, it is convenient to define a restricted form of tree decomposition. We 
say that a tree decomposition (T, B) is nice if the tree T is a rooted tree such that for every % £ I 
either 

1 . i has no children (i is a leaf node) , 

2. i has exactly two children i±, ii and Bi = B^ = Bi 2 holds (i is a join node), 

3. i has a single child i' and Bi = B^ U {v} for some v £ V (i is an introduce node), or 

4. i has a single child i' and Bi = B^ \ {v} for some v £ V (i is a forget node). 

It is well-known that every tree decomposition can be transformed into a nice tree decomposition 
of the same width in polynomial time. Furthermore, we can assume that the root bag contains 
only a single vertex. 

We will use the following lemma to obtain a nice tree decomposition with some further properties 
(a related trick was used in [Mar07], the proof is similar): 

Lemma 6. Let G be a graph having no adjacent degree 1 vertices. G has a nice tree decomposition 
of polynomial size having the following two additional properties: 

1. No introduce node introduces a degree 1 vertex. 

2. The vertices in a join node have degree greater than 1. 
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Proof. Consider a nice tree decomposition of graph G. First, if v is a vertex of degree 1, then we 
can assume that v appears only in one bag: if w is the unique neighbor of v, then it is sufficient 
that v appears in any one of the bags that contain w. Let B v = {v, x\, . . . , xt} be this bag where 
x\ = w. We modify the tree decomposition as follows. We replace B v with B' v = B v \ {v}, insert 
a bag B'l = B v \ {v} between B' v and its parent, and create a new bag B l = B v \ {v} that is the 
other child of B" (thus B'l is a join node). For i = 1, . . . , t — 1, let B l = {x\, . . . , Xj}, and let B % be 
the child of B i+1 . Finally, let B w = {w,v} be the child of B 1 and let B = {v} be the child of B w . 
Observe that B l (2 < i < t), B w are introduce nodes, B is a forget node, and B is a leaf node. 
This operation ensures that vertex v appears only in a leaf node. It is clear that after repeating 
this operation for every vertex of degree 1, the two required properties will hold. □ 

We also need a basic notion of embedding; see, e.g., [RS94{ [CM05], In this paper, an embedding 
refers to a 2-cell embedding, i.e., a drawing of the vertices and edges of the graph as points and 
arcs in a surface such that every face (connected component obtained after removing edges and 
vertices of the embedded graph) is homeomorphic to an open disk. We use basic terminology and 
notions about embeddings as introduced in [MTOl]. We only consider compact surfaces without 
boundary. Occasionally, we refer to embeddings in the plane, when we actually mean embeddings 
in the 2-sphere. If S is a surface, then for a graph G that is (2-cell) embedded in S with / facial 
walks, the number g = 2 — |V(G)| + \E(G)\ — f is independent of G and is called the Euler genus 
of S. The Euler genus coincides with the crosscap number if S is non-orientable, and equals twice 
the usual genus if the surface S is orientable. 



3 Prize-collecting clustering 

In this section, we describe an algorithm PC-Clustering that is used to prove Theorem [3j The 
algorithm as well as its analysis bears similarities to the primal-dual method due to Agrawal, Klein 
and Ravi [AKR95| and Goemans and Williamson [GW95]. It uses a technique that we call prize- 
collecting clustering and our analysis strengthens the previous approaches by proving some local 
guarantees (instead of the global guarantee provided in previous algorithms). We build a forest F2 
each of whose components correspond to one sought in Theorem [3j Along the way, we also come 
up with a vector y satisfying the sets of constraints (pQ)-([3]) below. During the process, we maintain 
a vector y satisfying all these constraints, and at the end, it will be true that all the constraints (|2|) 
hold with equality. The analysis takes advantage of the connection between F2 and y. 

We start with a 2-approximat^] solution F* satisfying all the demands in T>; the cost of F* is 
at most 20PT. The forest F* consists of tree components T*. In the following, we connect some 
of these components to make the trees T{. It is easy to see this construction guarantees the first 
two conditions of Theorem [3j We work on a graph G(V, E) formed from Gi n by contracting each 
tree component of F*. A potential <j> v is associated with each vertex v of G, which is = times the 
cost of the tree component corresponding to v in case v is the contraction of a tree component, and 
zero otherwise. 

1 Such a solution can be found via Goemans- Williamson's Steiner forest algorithm, for instance. 
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These constraints are very similar to the dual LP for the prize- collecting Steiner tree problem 
when <j> v are thought of as penalty values corresponding to the vertices. In the standard linear 
program for the prize-collecting Steiner tree problem, there is a special root vertex to which all the 
terminals are to be connected. Then, no set containing the root appears in the formulation. 

The solution is built up in two stages. First we perform an unrooted growth to find a forest 
F\ and a corresponding y vector. In the second stage, we prune some of the edges of F\ to get 
another forest F%. Uncontracting the trees T* turns F% into the Steiner trees T, in the statement 
of Theorem [3l Below we describe the two phases of Algorithm Q] (PC-Clustering). 

Growth We begin with a zero vector y, and an empty set F\ . We maintain a partition C of vertices 
V into clusters; it initially consists of singleton sets. Each cluster is either active or inactive] the 
cluster C G C is active if and only if Y^ccc^2veC' VC,v < YlveC&v- A vertex v is live if and 
only if ^2c 5v yc,v < <t>v Equivalently, a cluster C G C is active if and only if there is a live vertex 
v G C. We simultaneously grow all the active clusters by r]. In particular, if there are n(C) > live 
vertices in an active cluster C, we increase yc, v by T)/k{C) for each live vertex v G C. Hence, yc 
defined as YlveG VG,v is increased by rj for an active cluster C. We pick the largest value for rj that 
does not violate any of the constraints in ([1]) or (|2|) . Obviously, r/ is finite in each iteration because 
the values of these variables cannot be larger than Y2 V 4>v- Hence, at least one such constraint goes 
tight after the growth step. If this happens for an edge constraint for e = (u,v), then there are 
two clusters C u 3 u and C v 3 v in C; at least one of the two is growing. We merge the two clusters 
into C = C u U C v by adding the edge e to F\, remove the old clusters and add the new one to C. 
Nothing needs to be done if a constraint ([2]) becomes tight. The number of iterations is at most 
2\V\, because at each event either a vertex dies, or the size of C decreases. 

We can think of the growth stage as a process that colors portions of the edges of the graph. 
This gives a better intuition to the algorithm, and makes several of the following lemmas intuitively 
simple. Consider a topological structure in which vertices of the graph are represented by points, 
and each edge is a curve connecting its endpoints whose length is equal to the weight of the edge. 
Suppose a cluster C is growing by an amount r\. This is distributed among all the live vertices 
v G C, where yc )V is increased by r( := r//re(C). As a result, we color by color v a connected portion 
with length rj of all the edges in 5(C). Finally, each edge e gets exactly Y^c-eeS(C) Vc,v units of 
color v. We can perform a clean-up process, such that all the portions of color v are consecutive on 
an edge! Hence, as a cluster expands, it colors its boundary by the amount of growth. At the time 
when two clusters merge, their colors barely touch each other. At each point in time, the colors 
associated with the vertices of a cluster form a connected region. 

2 We can do without the clean-up if we perform the coloring in a lazy manner. That is, we do not do the actual 
color assignment until the edge goes tight or the algorithm terminates. At this point, we go about putting colors 
on the edges, and we make sure the color corresponding to any pair (S, v) forms a consecutive portition of the edge. 
This property is not needed as part of our algorithm, though, and is merely for the sake of having a nice coloring 
which is of independent interest. 
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Pruning Let S denote the set of all clusters formed during the execution of the growth step. It 
can be easily observed that the clusters S are laminar and the maximal clusters are the clusters of 
C. In addition, notice that i*i[C] is connected for each C E S. 

Let B C S be the set of all such clusters that are tight, namely, for each S E B, we have 
X^S'cs YlvsS' VS',v = 'Ylv&s in the pruning stage, we iteratively remove some edges from Fx to 
obtain F 2 . More specifically, we first initialize F 2 with F\. Then, as long as there is a cluster S E B 
such that F2 n 5(S) = {e}, we remove the edge e from F2. 

A cluster C is called a pruned cluster if it is pruned in the second stage in which case, 5(C)nF 2 = 
0. Hence, a pruned cluster cannot have non-empty and proper intersection with a connected 
component of F 2 . 

Algorithm 1 PC-Clustering 

Input: planar graph Gi n (Vi n , Ei n ), and set of demands T>. 
Output: set of trees T, with associated T>i. 

1: Use the algorithm of Goemans and Williamson [GW95] to find a 2-approximate Steiner forest 

F* of T>, consisting of tree components T£, . . . , T£. 
2: Contract each tree T* to build a new graph G(V, E). 

3: For any v E V, let 4> v be ^ times the cost of the tree T* corresponding to v, and zero if there 

is no such tree. 
4: Let F ± <- 0. 

5: Let ys, v for any v € S C V. 
6: Let S '*- C *- {{v} : v € V}. 
7: while there is a live vertex do 

8: Let rj be the largest possible value such that simultaneously increasing yc by r\ for all active 
clusters C does not violate Constraints (p])-©. 

9: Let y C {v),v *~ Vc(v),v + K{ c{v)) for a11 live vertices v - 

10: if Be G E that is tight and connects two clusters then 

11: Pick one such edge e = (u, v). 

12: Let F x <- F x U {e}. 

13: Let C *-C(u)UC(v). 

14: LetC ^Cu{C}\{C(u),C{v)}. 

15: Let S «- S U {C}. 

16: Let F 2 <— Fx. 

17: Let B be the set of all clusters S E S such that X^es VS,v = S»;g5 

18: while 3S e B such that F 2 n S(S) = {e} for an edge e do 

19: Let F 2 ^- F 2 \ {e}. 

20: Construct F from F2 by uncontracting all the trees T*. 

21: Let F consist of tree components T{. 

22: Output the set of trees {TJ, along with £>; := {(s,t) £ V : s,t E V(Ti)}. 



We first bound the cost of the forest i*2- The following lemma is similar to the analysis of the 
algorithm in [GW95J. However, we do not have a primal LP to give a bound on the dual. Rather, 
the upper bound for the cost is the sum of all the potential values Yl v 0w ^ n addition, we bound the 
cost of a forest F 2 that may have more than one connected component, whereas the prize-collecting 
Steiner tree algorithm of [GW95] finds a connected graph at the end. 
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Lemma 7. The cost of F2 is at most 2^2 ve y ( t >v - 

Proof. The strategy is to prove that the cost of this forest is at most 2 Ylvescv VS,v = 2 ^2 v& y ( t )v - 
The equality follows from Equation (|2|) — it holds with equality at the end of the algorithm. Recall 
that the growth phase has several events corresponding to an edge or set constraint going tight. 
We first break apart y variables by epoch. Let tj be the time at which the j th event point occurs 
in the growth phase (0 = to < t\ < *2 < " * * )> s ° the j' th epoch is the interval of time from tj—\ 
to tj. For each cluster C, let yjj be the amount by which yc ■= YlvecVCjV grew during epoch 
j, which is tj — tj-i if it was active during this epoch, and zero otherwise. Thus, yc = YljVc^- 
Because each edge e of F2 was added at some point by the growth stage when its edge packing 
constraint (pQ) became tight, we can exactly apportion the cost c e amongst the collection of clusters 
{C : e G 5(C)} whose variables "pay for" the edge, and can divide this up further by epoch. In 
other words, c e = J2j J2c-e£S(c) Vc ' ^ e w ^ n now P rove that the total edge cost from F2 that is 
apportioned to epoch j is at most ^YjcVg • ^ n other words, during each epoch, the total rate at 
which edges of F2 are paid for by all active clusters is at most twice the number of active clusters. 
Summing over the epochs yields the desired conclusion. 

We now analyze an arbitrary epoch j. Let Cj denote the set of clusters that existed during 
epoch j. Consider the graph F%, and then collapse each cluster C € Cj into a supernode. Call the 
resulting graph H. Although the nodes of H are identified with clusters in Cj, we will continue to 
refer to them as clusters, in order to to avoid confusion with the nodes of the original graph. Some 
of the clusters are active and some may be inactive. Let us denote the active and inactive clusters 
in Cj by C ac t and C^ead-, respectively. The edges of F2 that are being partially paid for during epoch 
j are exactly those edges of H that are incident to an active cluster, and the total amount of these 
edges that is paid off during epoch j is (tj — ij-i) X^ceC act deg#(C). Since every active cluster 
grows by exactly tj — tj-\ in epoch j, we have YlcVcP — SceC Vc = (A? — tj-i)\Cact\- Thus, it 
suffices to show that YlceCact °- e S-H'(^') — 2|C ac t|. 

First we must make some simple observations about H. Since F2 is a subset of the edges in 
Fi, and each cluster represents a disjoint induced connected subtree of F\, the contraction to H 
introduces no cycles. Thus, H is a forest. All the leaves of H must be alive, because otherwise the 
corresponding cluster C would be in B and hence would have been pruned away. 

With this information about H, it is easy to bound YlceC degjy(C). The total degree in 
H is at most 2(|C arf | + |Q eac j|). Noticing that the degree of dead clusters is at least two, we get 
YjCtCact ^h{C) < 1{\C ac t\ + \Cdead\) - 2\Cdead\ = 2|C oct | as desired. □ 

The following lemma gives a sufficient condition for two vertices that end up in the same 
component of F%. This is a corollary of our pruning rule which has a major difference from other 
pruning rules. Unlike the previous work, we do not prune the entire subgraph; rather, we only 
remove some edges, increasing the number of connected components. 

Lemma 8. Two vertices u and v of V are connected via F2 if there exist sets S,S' both containing 
u, v such that ys jV > and ys', u > 0. 

Proof. The growth stage connects u and v since ys >v > and u,v € S. Consider the path p 
connecting u and v in F\. All the vertices of p are in S and S'. For the sake of reaching a 
contradiction, suppose some edges of p are pruned. Let e be the first edge being pruned on the 
path p. Thus, there must be a cluster C E B cutting e; furthermore, 5(C) dp = {e}, since e is the 
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first edge pruned from p. The laminarity of the clusters S gives C C S,S' , since C contains exactly 
one endpoint of e. If C contains both or no endpoints of p, it cannot cut p at only one edge. Thus, 
C containts exactly one endpoint of p, say v. We then have Sccc = because C is tight. 
However, as C is a proper subset of S, this contradicts with ys )V > 0, proving the supposition is 
false. The case C contains u is symmetric. □ 

Consider a pair (v, S) with ys, v > 0. If subgraph G' of G has an edge that goes through the 
cut (S, S), at least a portion of length yg, v of G' is colored with the color v due to the set S. Thus, 
if G' cuts all the sets S for which yg v > 0, we can charge part of the length of G' to the potential 
of v. Later in Lemma [TT| we are going to use potentials as a lower bound on the optimal solution. 
More formally, we say a graph G'(V,E') exhausts a color u if and only if E' n 5(S) ^ for any 
S '■ ys,u > 0. The proof of the following corollary is omitted here, however, it is implicit in the 
proof of Lemma QT] below. We do not use this corollary explicity. Nevertheless, it gives insight into 
the analysis below. 

Corollary 9. If a subgraph H of G connects two vertices u\,U2 from different components of F2 (which 
are contracted versions of the components in the initial 2-approximate solution), then H exhausts the 
color corresponding to at least one of u\ and U2- 

We can relate the cost of a subgraph to the potential value of the colors it exhausts. 

Lemma 10. Let L be the set of colors exhausted by subgraph G' of G. The cost of G'(V,E') is at 
least J2veL<f>v- 

This is quite intuitive. Recall that the y variables color the edges of the graph. Consider a 
segment on edges corresponding to cluster S with color v. At least one edge of G' passes through 
the cut (S,S). Thus, a portion of the cost of G 1 can be charged to ys,v Hence, the total cost of 
the graph G' is at least as large as the total amount of colors paid for by L. We now provide a 
formal proof. 

Proof. The cost of G'(V, E) is 

eEE' eeE> S:ee5(S) 

= Y,\E'n5(S)\ ys 

s 

> Yl y s 

S:E'n5(S)^0veS 

= Y1 y^ 

v S3v:E>n8(S)^(l) 

^2 Yl y^ 

v&L SBv:E'nS(S)^fD 
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because ys jV = if v G L and E' n 5(S) = 0, 

by a tight version of (|2|) . □ 

Recall that the trees T* of the 2-approximate solution F* are contracted in F2. Construct F 
from F2 by uncontracting all these trees. Let F consist of tree components Tj. It is not difficult 
to verify that F is indeed a forest, but we do not need this condition since we can always remove 
cycles to find a forest. Define T>i := {(s,t) G 2? : s,t G V(Tj)}, and let OPTj denote (the cost of) 
the optimal Steiner forest satisfying the demands Pj. 

Lemma 11. £\ OPT; < (1 + e)OPT. 

Proof. If each tree component of OPT only satisfies demands from a single T>i, we are done. Oth- 
erwise, we build a solution OPT' consisting of forests OPT^ for each T>i, such that OPT^ < 
(1 + e)OPT. Then, OPT; < OPT^ finishes the argument. Instead of explicitly identifying the 
forests OPT^, we construct an inexpensive set of trees T where each T G T is associated with a 
subset Dt C T> of demands, such that 

• T G T connects all the demands in Dt, 

• all the demands are satisfied, namely, UtgT-Ct = P, and 

• for each TgT, there exists a Pj where Dt Q T>i. 

Then, each OPT^ above is merely a collection of several trees in T. We now describe how T is 
constructed. Start with the optimal solution OPT. Contract the trees T* of the 2-approximate 
solution F* to build a graph OPT. Initially, the set Df associated with each tree component T of 
OPT specifies the set of demands connected via T. If the color u corresponding to a tree component 
T* is exhausted by OPT, add T* to T, and let Dt* be the set of demands satisfied via T*. Then, 
for any tree component Tj of OPT, remove UtsT-Dt from D T . Finally, construct the uncontracted 

version of all the trees Tj, and add them to T. 

The first condition above clearly holds by the construction. For the second condition, note that 
at the beginning all the demands are satisfied by the trees Tj, and we only remove demands from 
Drp when they are satisfied elsewhere. The last condition is proved by contradiction. Suppose 
a tree T € T satisfies demands from two different groups T>\ and T>i- Clearly, T cannot be one 
of the contracted trees T*, since those trees only serve a subset of one £>j. Hence, T must be a 
tree component of OPT. Let u,v G V be two vertices from different components of F2 that are 
connected in T. Since u and v are not connected in F%, Lemma [8] ensures that for at least one of 
these vertices, say u, we have ys jU = VS B u, v. Thus, ys, u > implies v S. As T connects 
u and v, this means T exhausts the color u. We reach a contradiction, because the demands 
corresponding to u are still in Dt- 

Finally, we show that the cost of T is small. Recall that T consists of two kinds of trees: tree 
components of OPT as well as trees corresponding to exhausted colors of OPT. We claim the cost 
of the latter is at most eOPT. Let L be the set of exhausted colors of OPT. The cost of the 
corresponding trees is e^^ei 0f definition of <j> v . Lemma [TPl however, gives OPT > J2 v eL <^>- 
Therefore, the total cost of T is at most (1 + e)OPT. □ 
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Now, we are ready to prove the main theorem of this section. 

Proof of Theorem The first condition of the lemma follows directly from our construction: we 
start with a solution, and never disconnect one of the tree components in the process. The con- 
struction immediately implies the second condition. By Lemma [7J the cost of F2 is at most 
2*l2veV — 7OPT. Thus, F costs no more than (4/e + 2)OPT, giving the third condition. 
Finally, Lemma [TT1 establishes the last condition. □ 



4 Constructing the spanner 

The goal of this section is to prove Theorem [2l Recall that we are given a graph Gi n (Vi n , Ei n ), 
and a set of demands T>. From Theorem [31 we obtain a set of trees {T\ } . . . , associated with a 
partition of demands {£>i, . . . ,£>&}: tree Tj connects all the demands T>i, and the total cost of trees 
Ti is in O(OPT). The construction goes along the same lines as that of Borradaile et al. [BKMK07J, 
yet there are certain differences, especially in the analysis. The construction is done in four steps. 
We separately build a graph Hi for each Tj, and finally let H be the union of all graphs Hi. 

Step 1: cutting the graph open Construct the graph Gj as follows. Start from Gi n and 
duplicate each edge in Tj. Let £i be the Eulerian tour of the duplicated Tj. Introduce multiple 
copies of vertices in Tj if necessary, so as to transform Si into a simple cycle enclosing no vertices. 
We next change the embedding so that this cycle is the infinite face of the graph. It is clear that 
the length of the boundary of G\ is twice the length of Tj. 

Step 2: building panels Borradaile et al. pick a subgraph Gf of Gj which contains the boundary 
of Gj, such that its length is at most /i(e)OPT^(Gj n ), and partitions G\ into panels. The panels 
are the finite faces of G\. We describe the details of this step below. 

We first decompose G\ into strips. Let Q be the boundary of Gj. Let Q[x, y] denote the unique 
nonempty counterclockwise x-to-y subpath of Q. We define Q[x, x] := Q. We use a recursive 
algorithm: Find vertices x,y on Q such that(f| 

• (1 + e) dist G i (x, y) < £(Q[x,y]), and 

• (1 + e) dist G i (a/, y') > £(Q[x' , y']) for every x' , y' in Q[x, y] such that x' 7^ x or y' ^ y. 

Let B be a shortest path from x to y in Gj. Then, the subgraph enclosed by Q[x,y] U B is called 
a strip. We call the path B the blue boundary of the strip, whereas the path Q[x,y] is called the 
red boundary of the strip, and is denoted by R. The blue and red boundaries are thought of as the 
upper and lower boundaries, respectively. Then, both boundaries are oriented from left to right for 
the following discussion. 

Recursively decompose the subgraph Gj enclosed by BUQ— Q[x, y] into strips (if it is nontrivial). 
The next lemma is easy to prove. 

Lemma 12 (Inequality (10), Klein [Kle06] ) . The total length of all the boundary edges of all the 
strips is at most (e" 1 + 1) times the length of Q. 

3 This is always possible, since x = y satisfies these conditions. 
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After decomposing G\ into strips, we find short paths — called columns henceforth — crossing 
each strip. Consider a strip, and let R and B be its red and blue boundaries. 

Vertices ro, n, . . . are inductively selected as follows. Let tq be the left endpoint common to R 
and B. For j = 1,2,... , find the vertex rj on R such that 

• (1 + e) dist G i (r*j, B) < dist n(rj,rj-{) + dist G i (Vj-i, B), and 

• (1 + e) dist G i (x, B) > distf^x, rj-i) + dist G i(rj_i, B) for every x in R\rj-x,rj). 

For j = 0, 1, 2, ... , column Cj is defined to be a shortest path from r,- to B. Note that Co is a 
path with no edges since ro belongs to B. We also include as a column the no-edge path starting 
and ending at the rightmost vertex common to R and B. The total length of the columns can be 
bounded as follows. 

Lemma 13 (Lemma 5.2 of Klein [Kle06]). The sum of the lengths of the columns of a strip with red 
boundary R is at most e~ l £(R). 

Roughly speaking, for an integer parameter a = a(e) whose precise value will be fixed later, we 
select every a th column of each strip to finish the construction of Gf. In particular, let Co, Ci, . . . , C s 
be the set of columns of a strip. For each i = 0, 1, . . . , a, define the subset of columns Cj := {Cj : j = 
i (mod a)}. We select a group Ci whose total length is the least. The selected columns are called 
supercolumns. Clearly, Lemma [131 implies the length of the supercolumns is at most e _1 /a • £(R). 

The supercolumns partition each strip into panels. Each panel is bounded above by a blue 
boundary, and below by a red boundary. It is also bounded to the left and right by columns — recall 
that each strip is assumed to have trivial columns of length zero at its extreme left and right. The 
value a will be sufficiently large such that we may add all the supercolumns to the optimal solution 
without significant increase in length. 

Step 3: selecting portals They next select about h = h(e) vertices on the red and blue bound- 
aries of each panel as its portals. Denote the set of portals by II. These are picked such that they 
are almost equidistant. 

Consider a panel P, enclosed by Ci, B,C2, R. Define r := ■ The left-most and right- 

most vertices of R and B are designated portals. In addition, we designate a minimal set of vertices 
of R and B as portals, such that the distance between two consecutive portals on R (or B) is at most 
r. Ignoring the right-most (and left-most) portals, the minimality property guarantees the odd- 
numbered portals (even-numbered portals, respectively) have distance larger than r. Therefore, 
there are at most 2h + 4 = 0(h) portals on each panel 

Step 4: adding Steiner trees For each panel P, and any selection of its portals IT C II, we 
add the optimal Steiner tree spanning II' and only using the boundary or the inside of P. This is 
done assuming that the left and right borders of the panel have length zero. This can be done in 
time polynomial in h(e) using the algorithm of Erickson et al. [EMV87 ] , since all these terminals 
lie on the boundary face of a planar graph. Notice that for a fixed e, there are at most a constant 
number of such Steiner trees, and the length of each is at most the length of the boundary of the 
panel. 

The spanner Hi consists of C?, the Steiner trees built in the previous step, and edges of length 
zero connecting copies of a vertex on the infinite face. It is easy to observe the latter edges could 
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be added in such a way as to respect the planar embedding. The following lemma is the main piece 
in proving the shortness property. 

Lemma 14. Length of Hi is at most /(e)^(Tj) for a universal certain function /(e). 

Proof. By Lemma [T2l the sum of the lengths of the red boundaries of all strip boundaries is at most 
(e" 1 + l)£(Q). We can then apply Lemma [JJ to bound the total length of G\ by (e^ 1 + 1) 2 £(Q). 
Since the length of each Steiner tree added in the last step is at most the length of the boundary 
of the panel, the total length of Steiner trees added is at most 2°W times the sum of the length of 
all panels. The sum of the lengths of the panels is twice the length of Gf. Thus, the length of Hi 
is at most (2°^ + 1)£(G 2 ). We finally notice that the length of the boundary Q of G\ is 2£(Ti), to 
conclude with the desired bound. □ 

The spanning property is more involved to prove. We first mention a lemma proved in [BKM07J 
that states a few portal vertices are sufficient on each panel. 

Lemma 15 (Theorem 4 of |BKM07Q . Let P be a panel enclosed by C ll B,C 2 ,R. Further, let F be 
a forest strictly enclosed by the boundary of P. There is a forest F of P with the following properties: 

1. If two vertices on R U B are connected in F, then they are also connected in F U C\ U C 2 . 

2. The number of vertices in R U B that are endpoints of edges in F — (R U B) is at most a(e). 

3. £{F) < (l + ce)£(F). 

Here c is an absolute constant and a(e) = o(e -5,5 ). 

We are now at a position to prove the spanning property of H . Recall that H is formed by the 
union of the graphs Hi constructed above. 

Lemma 16. OPT v (H) < (1 + c'e)OPT v (G in ). 

Proof. Take the optimal solution OPT. Find forests OPTj satisfying demands T>i, i.e., £(OPTj) = 
OPT Vi (G in ). We can apply TheoremEJto get £^(OPT;) < (1 + e)OPT. 

Consider one OPTj that serves the respective set of demands T>i. Add the set of all supercolumns 
of Hi to OPTj to get OPT*. Recall the total length of these supercolumns is at most £{G 2 )/k. 
Pick q := e 2 /2(e _1 + l) 2 to bound the supercolumns cost by e 2 £(Ti). Next, use Lemma [TBI to 
replace the intersection of OPTj and each panel with another forest having the properties of the 
lemma. Let OPT 2 be the new forest. The cost of the solution increases by no more than ce£(OPTj). 
Furthermore, as a result, OPT 2 crosses each panel at most a(e) times. We claim, provided that 
h is sufficiently large compared to a, we can ensure that moving these intersection points to the 
portals introduces no more than an e factor to the cost. 

Consider a panel P, with boundaries C\,B, C 2 , R. Connect each intersection point of the panel 
to its closest portal. Each connection on a panel P moves by at most t(P). (In fact, it is at most 
t(P)/2.) The total movement of each panel is at most ar(P) = a(£(R) + £(B))/h. Hence, the total 
additional cost for all panels of Hi is no more than a/h ■ £(Gf) = a/h ■ (e _1 + l) 2 ^(Tj). We pick 
h(e) := e 2 a(e)(e~ 1 + l) 2 . Thus, the addition is at most e 2 £(T;). 

Finally, we replace the forests inside each panel P by the Steiner trees provisioned in the last 
step of our spanner construction. Take a panel P with the set of portals IL Let Ki,K 2 , ... be 
the connected components of OPT 2 inside P. Each intersection point is connected to a portal of 
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P. Replace each Kj by the optimal Steiner tree connecting corresponding to this subset of portals. 
This procedure does not increase the length and produces a graph OPT?. 

Clearly, OPT? satisfies all the demands in Dj. Thus, the union of all forests OPT?, henceforth 
referred to as OPT*, gives a solution for the given Steiner forest instance. It only remains to bound 
the cost of OPT*. We know that 

£(OPT*) < ^OPT^, (4) 

i 

which may be strict due to common edges between different OPT? forests. Replacing the Steiner 
trees by the optimal Steiner trees between portals cannot increase the cost, so, the only cost increase 
comes from connections to portals. Thus, we get 

^(OPT^ + e 2 ^)). (5) 

i 

From the above discussion, 

£(OPT 2 ) < (1 + ce)£{OPT}), by Lemma [El and (6) 

£(OPT*) < ^(OPTi) + e 2 £(Ti) due to supercolumns' length. (7) 

Hence, the cost of OPT* is 

£(OPT*) < [(1 + ce ) 0PT * + e2 ( 2 + c)£{Ti)] by ©, ® and © 

i 

= J2 [(1 + ce)OPT t ] + [^(2 + c)l{T<)] 

i i 

< (1 + ce)(l + e)OPT + e 2 (2 + c) J^(Ti) by Theorem! 

i 

< (1 + ce)(l + e)OPT + e 2 (2 + c)(4/e + 2)OPT by Theorem[3] 
= [1 + (c + l)e + e 2 + (2 + c)(4e + 2e 2 )] OPT 

< (l + c'e)OPT, 

if we pick d < (c + l) + e + (2 + c)(4 + 2e). This can be achieved via d := 14 + 7c assuming e < 1. □ 

Runnig time In the above proof, d is a fixed constant independent of e. It can be easily verified 
that h and a have polynomial dependence on e. Therefore, /(e) has a singly exponential dependence 
on e. Borradaile et al. show how to carry out the above steps in time 0(n log n) for each Hi, with 
the contant having singly exponential dependence on e. There are at most n such subgraphs, hence, 
the running time of this step is 0(n 2 logn). Furthermore, Algorithm Q] can be implemented to run 
in time 0(n 2 logn). We do not explicitly store all the ys lV variables; rather, we only keep track of 
those which are relevant in the algorithm. The growth stage has 0(n) iterations, and as a result, 
the set S will have size 0{n). A simple implementation guarantees 0(n 2 logn) time for the growth 
step; see |GW95| . The pruning stage can also be implemented with a similar time bound: We 
maintain a priority queue for the clusters in B — the key for each cluster S is |i ? ri(5(S')|. Notice that 
there are at most 0(n) iterations in the while loop. In each iteration, we perform 0(n) updates 
to the clusters of the queue as a result of removing an edge. 

Proof of Theorem^ Immediate from Lemmas [14] and [16] if we use e/c instead of e in the above 
construction. □ 
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5 The PTAS for planar Steiner forest 



Having proved the spanner result, we can present our main PTAS for Steiner forest on planar 
graphs in this section. We first mention two main ingredients of the algorithm. We invoke the 
following result due to Demaine, Hajiaghayi and Mohar [DHM07]. 

Theorem 17 ([DHM07J). For a fixed genus g, and any integer k > 2, and for every graph G of Euler 
genus at most g, the edges of G can be partitioned into k sets such that contracting any one of the 
sets results in a graph of treewidth at most 0(g 2 k). Furthermore, such a partition can be found in 

0(g 5 / 2 n 3 / 2 logn) time. 

As a corollary, this holds for a planar graph (which has genus 1). 

We can now prove the main theorem of this paper. Algorithm [2] (PSF-PTAS) shows the steps 
of the PTAS. 

Algorithm 2 PSF-PTAS 

Input: planar graph Gi n (Vi n , Ei n ), and set of demands V. 
Output: Steiner forest F satisfying T>. 

1: Construct the Steiner forest spanner H. 

2: Let k <- 2/(e)/e. 

3: Let e «— min(l,e/6). 

4: Using Theorem [TTJ, partition the edges of H into E%, . . . ,Ek. 
5: Let i* <— argminj t(Ei). 

6: Find a (1 + e)-approximate Steiner forest F* of T> in H/Ei* via Theorem [5) 
7: Output F* U Ei* . 



Proof of Theorem^ Given are planar graph G{ n , and the set of demand pairs T>. We build a 
Steiner forest spanner H using Theorem [2j For a suitable value of k whose precise value will be 
fixed below, we apply Theorem 1171 to partition the edges of H into E±, Ei, ■ ■ ■ , E/.- Let E^ be the 
set having the least total length. The total length of edges in Ei* is at most 1(H) /k. Contracting 
E{* produces a graph H* of treewidth 0(k). 

Theorem [5] allows us to find a solution OPT* corresponding to H* . Adding the edges Ei* clearly 
gives a solution for H whose value is at most (1 + e)OPTx>(^) + i(H)/k. Letting e = min(l, e/6) 
and k = 2/(e)/e guarantees that the cost of this solution is 



The running time of all the algorithm except for the bounded-treewidth PTAS is bounded by 
0(n 2 logn). The parameter k above has a singly exponential dependence on e. Yet, the running 
time of the current procedure for solving the bounded-treewidth instances is not bounded by a 
low-degree polynomial; rather, k amd e appear in the exponent in the polynomial. Were we able to 
improve the running time of this procedure, we would obtain a PTAS that runs in time 0(n 2 log n). 



< (l + e) 2 OPT v (G in )+e(H)/k 

< (1 + e) 2 OPT c (G m ) + |0PT 7? (G i „) 
(1 + e)OPT v (G in ) 



by Theorem [2] 
by Theorem [2] and choice of k 



by the choice of e. 



□ 
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5.1 Extension to bounded-genus graphs 

We can generalize this result to the case of bounded-genus graphs. Theorem [3] does not assume 
any special structure for the input graph. The ideas of Section U] can be generalized to the case 
of bounded-genus graphs similarly to the work of Borradaile et al. [BDT09]. This process does 
not increase the Euler genus of the graph, since the resulting graph has a subset of original edges. 
Theorem 1171 works for such graphs as well, and hence, as in Theorem [IJ we can reduce the problem 
to a bounded-treewidth graph on which we apply Theorem [5j 

Theorem 18. For any constant e > 0, there is a polynomial-time (1 + e)-approximation algorithm for 
the graphs of Euler genus at most g. 

6 PTAS for graphs of bounded treewidth 

The purpose of this section is to prove Theorem [5] by presenting a PTAS for Steiner forest on 
graphs of bounded treewidth. 

6.1 Groups 

We define a notion of group that will be crucial in the algorithm. A group is defined by a set S of 
center vertices, a set X of "interesting" vertices, and a maximum distance r; the group Qq(X, S,r) 
contains S and those vertices of X that are at distance at most r from some vertex in S. 

Lemma 19. Let T be a Steiner tree of X C V(G) with cost W. For every e > 0, there is a set S C X 
of 0(1 + 1/e) vertices such that X = Q G (X, S, eW). 

Proof. Let us select vertices sx, s 2 , ... from X as long as possible, with the requirement that the 
distance of Sj is more than eW from every Sj, 1 < j < i. Suppose that st is the last vertex selected 
this way. We claim that t < 1 + 2/e. Consider a shortest closed tour in G that visits the vertices 
s\, . . . , St- As the distance between any two such vertices is more than eW, the total length of the 
tour is more than teW (assuming that t > 1). On the other hand, all these vertices are on the tree 
T and it is well known that there is a closed tour that visits every vertex of the tree in such a way 
that every edge of the tree is traversed exactly twice and no other edge of the graph is used. Hence 
the shortest tour has length at most 2W and t < 2/e follows. □ 

The following consequence of the definition of group is easy to see: 

Proposition 20. If Sx,S 2 ,Xx,X 2 are subsets of vertices and rx,r 2 are real numbers, then 

G G (Xx , Sx , n ) U Q G (X 2 , S 2 , r 2 ) C Q G [X x U X 2 , Sx U S 2 , max{r x , r 2 } ) . 

6.2 Conforming solutions 

Let B = (Bi)i=x... n be the bags of a rooted nice tree decomposition of width k. Let Vi be the set 
of vertices appearing in B{ or in a descendant of B^. Let Ai be the set of active vertices at bag 
Bi~. those vertices v € Vi for which there is a demaind {v,w} € V with w Vi. Let Gi := G[Vi\. 
A Steiner forest F induces a partition iii{F) of Ai for every i = 1, . . . ,n: let two vertices of Ai be 
in the same class of %i(F) if and only if they are in the same component of F. Note that if F is 
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restricted to Gi, then a component of F can be split into up to k + 1 components, thus 7Tj(F) is a 
coarser partition then the partition defined by the components of the restriction of F to Gi. 

Let II = (IIj)j = i... n be a collection such that 11, is a set of partitions of Ai. If Hi{F) G 11, for every 
bag Bi, then we say that F conforms to II. The aim of this subsection is to give a polynomial-time 
algorithm for bounded treewidth graphs that finds a minimum cost solution conforming to a given 
n (for fixed k, the running time is polynomial in the size of the graph and the size of the collection 
LT on graph with treewidth at most k). In Section [6.3^ we construct a polynomial-size collection II 
such that there is a (1 + e)-approximate solution that conforms to II. Putting together these two 
results, we get a PTAS for the Steiner forest problem on bounded treewidth graphs. 

Lemma 21. For every fixed k, there is a polynomial time algorithm that, given a graph G with 
treewidth at most k and a collection II, finds the minimum cost Steiner forest conforming to II. 

The proof of Lemma [21] follows the standard dynamic programming approach, but it is not 
completely trivial. First, we use a technical trick that makes the presentation of the dynamic pro- 
gramming algorithm simpler. We can assume that every terminal vertex v has degree 1: otherwise, 
moving the terminal to a new degree 1 vertex v' attached to v with an edge vv ' having length does 
not change the problem and does not increase treewidth. Thus by LemmaEl it can be assumed that 
we have a nice tree decomposition (T, B) of width at most k where no terminal vertex is introduced 
and the join nodes contain no terminal vertices. For the rest of the section, we fix such a tree 
decomposition and notation Vi, Ai, etc. refer to this fixed decomposition. 

Let us introduce terminology and notation concerning partitions. A partition a of a set S can 
be considered as an equivalence relation on S. Hence we use notation (x, y) G a to say that x and 
y are in the same class of a. We denote by x a the class of a that contains element x. 

If F is a subgraph of G and S C V(G), then F induces a partition a of S: (x, y) G a if and only 
if x and y are in the same component of F (and every x G S \ V{F) forms its own class). We say 
that partition a is finer than partition (3 if (x, y) G a implies (x, y) G f3; in this case, [3 is coarser 
than a. We denote by a>\ V ai the unique finest partition a coarser than both a\ and a%. This 
definition is very useful in the following situation. Let Fx, Fi be subgraphs of G, and suppose that 
Fx and Fi induces partitions a\ and ai of a set S C V(G), respectively. If Fx and F2 intersect 
only in S, then the partition induced by the union of Fx and F2 is exactly otx V a.%. Let /3j be a 
partition of Bi for some i G I and let Fj be a subgraph of Gr[l^]. Then we denote by Fj + /3j the 
graph obtained from Fj by adding a new edge xy for every (x, y) G Pi. 

Following the usual way of designing algorithms for bounded-treewidth graphs, we define several 
subproblems for each node i G I. A subproblem at node i corresponds to finding a subgraph Fj 
in Gi satisfying certain properties: F, is supposed to be the restriction of a Steiner forest F to Vi. 
The properties defining a subproblem prescribe how Fj should look like form the "outside world" 
(i.e., from the part of G outside Vi) and they contain all the information necessary for deciding 
whether Fj can be extended, by edges outside Vi, to a conforming solution. Let us discuss briefly 
and informally what information these prescriptions should contain. Clearly, the edges of Fj in Bi 
and the way Fj connects the vertices of Bi (i.e., the partition a of Bi induced by Fj) is part of 
this information. Furthermore, the way Fj partitions Ai should also be part of this information. 
However, there is a subtle detail that makes the description of our algorithm significantly more 
technical. The definition of vrj(F) = tt means that the components of F partition Ai in a certain 
way. But the restriction Fj of F to Vj might induce a finer partition of Ai than tt: it is possible 
that two components of Fj are in the same component of F. This means that we cannot require 
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that the partition of A4 induced by Fi belongs to II. We avoid this problem by "imagining" the 
partition (3 of Bi induced by the full solution F, and require that Fi partition Ai according to ir 
if each class of (3 becomes connected somehow. In other words, instead of requiring that Fi itself 
partitions Ai in a certain way, we require that Fj + (3 induces a certain partition. 
Formally, each subproblem P is defined by a tuple (i, H, tt, a, (3, /x), where 

(51) % 6 I is a node of T, 

(52) H is a spanning subgraph of G[Bi] (i.e., contains all vertices of G[Bi]), 

(53) 7T £ Ilj is a partition of Ai, 

(54) a, (3 are partitions of Bi and (3 is coarser than a, 

(55) \x is an injective mapping from the classes of 7Tj to the classes of (3. 

The solution c(i,H,ir,a, j3, /x) of a subproblem P is the minimum cost of a subgraph Fi of G[Vi] 
satisfying all of the following requirements: 

(CI) Fi[Bi) = H (which implies Bi C V(F f )). 

(C2) a is the partition of Bi induced by Fi. 

(C3) The partition of A4 induced by Fi + (3 is 7r. 

(C4) For every descendant d of i, the partition of Ad induced by F, L + j3 belongs to ILi. 

(C5) If there is a terminal pair {x\, X2) with x%,X2 € Vi, then they are connected in Fi + (3. 

(C6) Every x £ Aj is in the component of i*i + (3 containing fi(x v ). 

We solve these subproblems by bottom-up dynamic programming. Let us discuss how to solve 
a subproblem depending on the type of i. 

Leaf nodes i. If i is a leaf node, then value of the solution is trivially 0. 

Join node i having children i\, i<i- Note that A^ and Ai 2 are disjoint: the vertices of a join 
node are not terminal vertices. The set Ai is a subset of Ai 1 U Ai 2 and it may be a proper subset: 
if there is a pair {x, y) with x E Ai x , y G Ai 2 , then x or y might not be in Ai. 

We show that the value of the subproblem is 

c (h H, 7r, a, (3, n) = min (c(i x , H, n 1 , a 1 , 0, /x 1 ) + c{i 2 , H,ir 2 ,a 2 , (3, fi 2 ) - £{H)), (8) 

( Jl),( J2),( J3),( J4) 

where the minimum is taken over all tuples satisfying, for p = 1,2, all of the following conditions: 
(Jl) q 1 V a 2 = a. 

(J2) 7r and ir p are the same on Ai p n vlj. 
(J3) For every t> G Ai p n Ai, ^(v 7 ") = fj?(v wP ). 

(J4) If there is a terminal pair (xi, X2) with x\ G Ai and X2 € A2, then /i 1 (x] 1 " 1 ) = /^(xf 2 ). 

We will use the following observation repeatedly. Let F be subgraph of Gi and let F v = F\Vi p ]. 
Suppose that F induces partition a on Bi and (3 is coarser than a. Then two vertices of Vi p are 
connected in F + (3 if and only if they are connected in F p + (3. Indeed, F 3 ~ p does not provide any 
additional connectivity compared to F p : as (3 is coarser than a, if two vertices of Bi are connected 
in F 3 ~ p , then they are already adjacent in F p + f3. 

Proof of ([8]) Ze/t < © ri^t: 
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Let Pi = (ii,H, 7r j a , f3, /J, 1 ) and P2 = H, tt 2 , a 2 , {3, fj?) be subproblems minimizing the 
right hand side of ([8]), let F 1 and F 2 be optimum solutions of Pi and P2, respectively. Let F be 
the union of subgraphs F 1 and F 2 . It is clear that the cost of F is exactly the right hand side of 
( | 8 j) : the common edges of F 1 and F 2 are exactly the edges of H. We show that F is a solution of 
P, i.e., F satisfies requirements (C1)-(C6). 
(CI): Follows from F 1 [P>i] = F 2 [Bj\ = F[Bj\ = H. 

(C2): Follows from (Jl) and from the fact that F 1 and F 2 intersect only in Bi. 
(C3): First consider two vertices x, y G Ai p n A4. Vertices x and y are connected in F + /3 if and only 
if they are connected in F p + (3, which is true if and only if (x, y) G tt p holds, which is equivalent 
to (x,y) G tt by (J2). Now suppose that x G n vlj and y G Aj 2 n Aj. In this case, x and y are 
connected in F + /? if and only if there is a vertex of Bi reachable from x in F 1 + /3 and from y 
in F 2 + 0, or in other words, / u 1 (x 7r ) = ^(y 77 ). By (J3), this is equivalent to ^(x 71 ") = ^(x 71 '), or 
(x,y) G 7r (as fi is injective). 

(C4): If d is a descendant of i p , then the statement follows using that (C4) holds for solution F p 
of P v and the fact that for every descendant d of i p , Fi + (3 and F + (3 induces the same partition 
of A^. For d = i, the statement follows from the previous paragraph, i.e., from the fact that F + (3 
induces partition tt £ Ilj on ^. 

(C5): Consider a pair (xi,X2). If x\,X2 G or xi,X2 G Vi 2 , then the statement follows from (C5) 
on F 1 or F 2 . Suppose now that xi G and X2 G V^ 2 ; in this case, we have xi G A^ and X2 G Ai 2 . 
By (C6) on F 1 and F 2 , x p is connected to fi p (xf) in F p + (3. By (J4), we have fi 1 (xf ) = (x^ ) , 
hence xi and X2 are connected to the same class of (3 in F + (3. 

(C6): Consider an x G that is in Aj p . By condition (C6) on F p , we have that x is connected in 
F p + (3 (and hence in F + /3) to ^(v 71 ), which equals ^(f^) by (J3). 

Proof of (HD ?e/5 > © nyM: 

Let F be a solution of subproblem (i, H, tt, a, (3, /i) and let F p be the subgraph of F induced 
by Vi p . To prove the inequality, we need to show three things. First, we have to define two 
tuples (ii, H, tt , a , (3, /i ) and (12, H, tt 2 , a 2 , f3, /i 2 ) that are subproblems, i.e., they satisfy (Sl)- 
(S5). Second, we show that (J1)-(J4) hold for these subproblems. Third, we show that F 1 and 
F 2 are solutions for these subproblems (i.e., (C1)-(C6)), hence they can be used to give an upper 
bound on the right hand side that matches the cost of F. 

Let a p be the partition of Bi induced by the components of F p ; as F 1 and F 2 intersect only in 
Bi, we have a = a 1 V a 2 , ensuring (Jl). Since (3 is coarser than a, it is coarser than both a 1 and 
a 2 . Let tt p be the partition of Ai p defined by F + (3, we have tt v G IIj p by (C4) for F. Furthermore, 
by (C3) for F, tt is the partition of Ai induced by F + (3, hence it is clear that tt and tt p are the 
same on Ai p n Ai, so (J2) holds. This also means that F + (3 (or equivalently, F p + (3) connects a 
class of tt p to exactly one class of (3; let fi p be the corresponding mapping from the classes of ir p 
to (3. Now (J4) is immediate. Furthermore, it is clear that the tuple (i p , H, tt p , a p , (3, /i p ) satisfies 
(S1)-(S5). 

We show that F p is a solution of subproblem (i p , H, tt p , a p , (3, ^ p ). As the edges of H are shared 
by F 1 and F 2 , it will follow that the right hand side of (JH]) is not greater than the left hand side. 
(CI): Obvious from the definition of F 1 and F 2 . 
(C2): Follows from the way a p is defined. 

(C3): Follows from the definition of tt p , and from the fact that F + (3 and F p + (3 induces the same 
partition on A4 . 
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(C4): Follows from (C4) on F and from the fact that F + f3 and F p + [3 induces the same partition 
on A d . 

(C5): Suppose that xi,X2 G VL. Then by (C5) for F, x\ and xi are connected in F + /5, hence 
they are connected in F + (3 as well. 
(C6): Follows from the definition of ijP. 

Introduce node i of vertex v. Let j be the child of i. Since v is not a terminal vertex, we 
have Aj = Ai. Let F' be a subgraph of G[Vj] and let F be obtained from F by adding vertex v to 
F' and making f adjacent to S C .Bj. If a' is the partition of Bj induced by the components of F', 
then we define the partition a'[v, S] of Bi to be the partition obtained by joining all the classes of 
a' that intersect S and adding v to this new class (if 5 = 0, then {v} is a class of a'[v, 5]). It is 
clear that a'[S, v] is the partition of Bi induced by F. 

We show that the value of a subproblem is given by 

c(i, H, ir, a, (3, n) = min c(j, H[Bj], n, a, 0, 0) + l{xv), (9) 

(-fl).C2),(/3) xveE(H) 

where the minimum is taken over all tuples satisfying all of the following: 

(11) q = a'[v, S], where S is the set of neighbors of v in H. 

(12) f3' is (3 restricted to Bj. 

(13) For every x € Ai, fj,(x n ) is the class of (3 containing 0(x w ). 

Proof of (JU) left < (JU) right: 

Let F' be an optimum solution of subproblem P' = (j, H[Bj], ir, a 1 ', /?', //). Let F be the graph 
obtained from F' by adding to it the edges of H incident to v; it is clear that the cost of F is 
exactly the right hand side of ([9]). Let us verify that (C1)-(C6) hold for F. 
(CI): Immediate. 

(C2): Holds because of (II) and the way a'[u, 5] was defined. 

(C3)-(C5): Observe that F + (3 connects two vertices of Vj if and only if F' + (3' does. Indeed, if a 
path in F + (3 connects two vertices via vertex v, then the two neighbors x, y of v on the path are 
in the same class of (3 as v, hence (by (12)) x,y are in the same class of (3 1 as well. In particular, 
for every descendant d of i, the components of F + (3 and the components of F' + (3 give the same 
partition of A&. 

(C6): Follows from (C6) for F' and from (13). 
Proof of © left > © right- 
Let F be a solution of subproblem (i, H, ir, a, (3, n) and let F' be the subgraph of F induced by 
Vj. We define a tuple (j, H[Bj],ir, a' , , fi') that is a subproblem, show that it satisfies (II)— (13) , 
and that F' is a solution of this subproblem. 

Let a! be the partition of Ai = Aj induced by F' and let be the restriction of (3 on Bj] these 
definitions ensure that (II) and (12) hold. Let //(x^) defined to be the class of arising as the 
restriction of the class n(x w ) to Bj; clearly, this ensures (13). Note that this is well defined, as it is 
not possible that ^{x 11 ) is a class of (3 consisting of only v: by (C6) for F, this would mean that v 
is the only vertex of Bi reachable from x in F. Since v is not a terminal vertex, v ^ x, thus if v is 
reachable from x, then at least one neighbor of v has to be reachable from x as well. 
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Let us verify that (S1)-(S5) hold for the tuple (j, H[Bj], tt, a', P' , /j,'). (SI) and (S2) clearly 
holds. (S3) follows from the fact that (C4) holds for F and Ai = Aj. To see that (S4) holds, 
observe that (x,y) £ a' implies (x,y) £ a, which implies (x,y) £ (3, which implies (x,y) £ (3'. (S5) 
is clear from the definition of //. 

The difference between the cost of F and the cost of F' is exactly Y^xveEfH) @{ xv )- Thus to 
show that the left of ([9]) is at most the right of Q, it is sufficient to show that F' is a solution of 
subproblem (j, H[Bj],ir, a' , , //). 
(CI) and (C2): Obvious. 

(C3)-(C5): As in the other direction, follows from the fact that F' + f3' induces the same partition 
of Aj = Ai as F + p. 

(C6): By the definition of /j,', it is clear that x is connected only to //(a;* - ) in F + (3 and hence in 
F' + p. 

Forget node i of vertex v. Let j be the child of i. We have Vi = Vj and hence Ai = Aj. We 
show that the value of a subproblem is given by 

c(i, H, tt, a, 8, u) = min cij, H' tt, a' B' uf), (10) 

V ^ (F1),(F2),(F3),(F4) W ' ' ' v ' 

where the minimum is taken over all tuples satisfying all of the following: 
(Fl) H'[Bi) = H. 

(F2) a is the restriction of a' to Bi. 

(F3) 3 is the restriction of P' to Bi and (x, v) € P' if and only if (x, v) £ a' . 

(F4) For every x € Ai, ^(x^) is the (nonempty) set fj,'(x n )\{v} (which implies that ^'(x 71 ) contains 
at least one vertex of Bi). 

Proof of (HO]) left < <HQ|| right: 

Let F be a solution of (j, H', tt, a', P', fj,'). We show that F is a solution of (j, H, tt, a, P, /i) as 
well. 

(CI): Clear because of (Fl). 
(C2): Clear because of (F2). 

(C3)-(C5): We only need to observe that F + p and F + P' have the same components: since by 
(F3), (x,v) G P' implies (x,v) £ a', the neighbors of v in F + P are reachable from v in F, thus 
F + P' does not add any further connectivity compared to F + p. 

(C6): Observe that if //(x 71 ^ are the vertices of Bj reachable from x in F + P' , then fi'(x n ) \ {v} 
are the vertices of Bi reachable form F + p. We have already seen that F + P and F + P' have 
the same components, thus the nonempty set n(x w ) is indeed the subset of Bi reachable from x in 
F + p. Furthermore, by (F3), P is the restriction of P' on Bi, thus if ^'(x 71 ) is a class of then 
n(x w ) is a class of p. 

Proof of (PH Ze/S > {TO} ri 5 /ii: 

Let F be a solution of (j, if, 7r, a, 8, //). We define a tuple (j, iJ', 7r, a' , P' , fi') that is a subprob- 
lem, we show that (F1)-(F3) hold, and that F is a solution of this subproblem. 

Let us define H' = F[Bj] and let a' be the partition of Bj induced by the components of F; these 
definitions ensure that (Fl) and (F2) hold. We define P' as the partition obtained by extending 
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(3 to Bj such that v belongs to the class of (3 that contains a vertex x G Bi with (x,v) G a' (it 
is clear that there is at most one such class; if there is no such class, then we let {v} be a class 
of (5'). It is clear that (F3) holds for this /?'. Let us note that F + (3 and F + (3' have the same 
connected components: if (x,v) G (3', then x and v are connected in F. Let p'ix^) be the subset 
of reachable from x in F + /?' (or equivalently, in F + (3). It is clear that p(x n ) = p'(x n ) \ {v'} 
holds, hence (F4) is satisfied. 

Let us verify first that (S1)-(S5) hold for (j, H' , tt, a', ft, //). (SI) and (S2) clearly holds. (S3) 
follows from the fact that (S3) holds for (i, H, tt, a, [3, p) and A- L = Aj. To see that (S4) holds, 
observe that if x, y G Bi, then (x, y) G a' implies (x, y) G a, which implies (x, y) G [3, which implies 
(x,y) G (3'. Furthermore, if (x,v) G a', then by (x,v) G (3' by the definition of [3. (S5) is clear from 
the definition of p! . 

We show that F is a solution of (j, H',ir, ol ,(3' , p!). 
(CI): Clear from the definition of H'. 
(C2): Clear from the definition of a'. 

(C3)-(C5): Follow from the fact that F + (3 and F + (3' have the same connected components. 
(C6): Follows from the definition of p! . 

6.3 Constructing the partitions 

Recall that the collection II contains a set of partitions IT for each i S /. Each partition in IF is 
defined by a sequence ((-Si, ri), . . . , (S p , r p )) of at most k + 1 pairs and a partition p of {1, . . . ,p}. 
We will denote by p(j) the class of the partition containing j. The pair (Sj,rj) consists of a set 
Sj of 0((k + 1)(1 + 1/e)) vertices of G{ and a nonnegative integer rj, which equals the distance 
between two vertices of G. There are at most \V(G)\°^ k+1 ^ 1+1 ^^ ■ \V(G)\ 2 possible pairs {Sj,rj) 
and hence at most \V (G)\°^ k+1 ^ different sequences. The number of possible partitions p 

is 0(k k ). Thus if we construct Ilj by considering all possible sequences constructed from every 
possible choice of (Sj,rj), the size of Ilj is polynomial in |F(G)| for every fixed k and e. 

We construct the partition tt corresponding to a particular sequence and p the following way. 
Each pair (Sj,Vj) can be used to define a group Rj = Qc{Ai, Sj,rj) of A{. Roughly speaking, for 
each class P of p, there is a corresponding class of it that contains the union of Rj for every j G P. 
However, the actual definition is somewhat more complicated. We want tt to be a partition, which 
means that the subsets of Ai corresponding to the different classes of p should be disjoint. In order 
to ensure disjointness, we define Rj := Rj \ [j^i Rj' ■ The partition tt of Ai is constructed as 
follows: for each class P of p, we let UjeP ^ e a c ^ ass °f 71 • Note that these classes are disjoint by 
construction. If these classes fully cover Ai, then we put the resulting partition tt into IL; otherwise, 
the sequence does not define a partition. This finishes the construction of ITj. 

Before showing that there is a good approximate solution conforming to the collection II defined 
above, we need a further definition. For two vertices u and v , we denote by u < v the fact that 
the topmost bag containing u is a proper descendant of the topmost bag containing v. Note that 
each bag is the topmost bag of at most one vertex in a nice tree decomposition (recall that we 
can assume that the root bag contains only a single vertex). Thus if u and v appear in the same 
bag, then u < v or v < u holds, i.e., this relation defines an ordering of the vertices in a bag. We 
can extend this relation to connected subset of vertices: for two disjoint connected sets K\, K2, 
K\ < K2 means that K2 has a vertex v such that u < v for every vertex u G K\, in other words, 
K\ < K2 means that the topmost bag where vertices from K\ appear is a proper descendant of 
the topmost bag where vertices from K2 appear. If there is a bag containing vertices from both K\ 
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and K2, then either K\ < K2 or K2 < K\ holds. The reason for this is that the bags containing 
vertices from K\ U K2 form a connected subtree of the tree decomposition, and if the topmost bag 
in this subtree contains vertex v € K\ U K2 , then u < v for every other vertex u in K\ U K2 ■ 

Lemma 22. There is a (1 + /ce)-approximate solution conforming to II. 

Proof. Let F be a minimum cost Steiner forest. We describe a procedure that adds further edges 
to F to transform it into a Steiner forest F' that conforms to II and has cost at most (1 + ke)£(F). 
We need a delicate charging argument to show that the total increase of the cost is at most ke ■ 1(F) 
during the procedure. In each step, we charge the increase of the cost to an ordered pair (K\, K2) 
of components of F. We are charging only to pairs (K\,K2) having the property that K\ < K2 
and there is a bag containing vertices from both K\ and K2 ■ Observe that if Bi is the topmost bag 
where vertices from K\ appear, then these properties imply that a vertex of K2 appears in this bag 
as well. Otherwise, if every bag containing vertices of K2 appears above Bi, then there is no bag 
containing vertices from both K\ and K% if every bag containing vertices from K2 appears below 
Bi, then K\ < K2 is not possible. Thus a component K\ can be the first component of at most k 
such pairs {K\,K%); since the components are disjoint, the topmost bag containing vertices from 
K\ can intersect at most k other components. We will charge a cost increase of at most e • 1{K\) 
on the pair [K\,K.2), thus the total increase is at most ke ■ £(F). It is a crucial point of the proof 
that we charge on (pairs of) components of the original solution F, even after several modification 
steps, when the components of F' can be larger than the original components of F. Actually, in 
the proof to follow, we will refer to three different types of components: 

(a) Components of the current solution F' . 

(b) Each component of F' contains one or more components of F. 

(c) If a component of F is restricted to the subset Vi, then it can split into up to fe+1 components. 

To emphasize the different meanings, and be clear as well, we use the terms a-component, b- 
component, and c-component. 

Initially, we set F' := F and it will be always true that F' is a supergraph of F, thus F' defines 
a partition of the b-components of F. Suppose that there is a bag Bi such that the partition iri(F') 
of Ai induced by F' is not in Ilj. Let K\ < K2 < ■ ■ ■ < K p be the b-components of F intersecting 
-Bj, ordered by the relation <. Some of these b-components might be in the same a-component of 
F'; let p be the partition of {1, . . . ,p} defined by F' on these b-components. 

Let Aij be the subset of Ai contained in Kj. The intersection of b-component Kj with Vi gives 
rise to at most k + 1 c-components, each of cost at most i(Kj). Thus by Lemma [T9and Prop. [20l 
there is a set Sj C V(Kj) of at most 0((k + 1)(1 + 1/e)) vertices such that Aij = Qc^Aij, Sj,rj) 
for some rj < e • t(Kj). If the sequence (Si,r%), . . . , (S p ,r p ) and the partition p give rise to the 
partition m(F'), then iTi(F') € Ilj. Otherwise, let us investigate the reason why this sequence and 
p do not define the partition TVi(F'). There are two possible problems: either a set R'j arising in the 
construction is too small (i.e., A^j % R'j) or too large (i.e., Rj contains a vertex from some A^j* 
such that j and j* are not in the same class of p). If none of these problems arise, then it is clear 
that the constructed partition it is indeed a partition and it is the same as the partition iri(F'). 

Let j be the smallest integer such that Rj is too large or too small. By the choice of Sj, 
Rj = Qc^Ai, Sj,rj) contains Aij. If no Rj* with j* < j is too large, then no such Rj* contains 
vertices from Aij, which means that Rj := Rj \ iH-*=i Rj* f un Y contains Aij. Thus Rj is not too 
small. Suppose now that R'j is too large: it contains a vertex of Aij* for some j* p(j)- It is not 
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possible that j* < j: by assumption, R'- t is not too small, thus R'j* fully contains Aij* and hence 
R'j is disjoint from Aij*. Thus we can assume that j* > j. The fact that Qq^A-i, Sj,rj) intersects 
Aij* means that there is a vertex u G Sj and vertex v G Aij* such that da^u^v) < e • l(Kj). Note 
that u is a vertex of b-component Kj (as Sj C -Ai,-) and u is a vertex of b-component i^j*. We 
modify i 7 ' by adding a shortest path that connects u and t> . Clearly, this increases the cost of F' 
by at most e • l(Kj), which we charge on the pair (Kj,Kj*). Note that Kj and Kj* both intersect 
the bag Bi and Kj < Kj*, as required in the beginning of the proof. Furthermore, Kj and Kj* 
are in the same a-component of F' after the modification, but not before. Thus we charge at most 
once on the pair (Kj, Kj'). 

Since the modification always extends F' , the procedure described above terminates after a 
finite number of steps. At this point, every partition iTi(F') belongs to the corresponding set IT, 
that is, the solution F' conforms to II. □ 



7 Algorithm for series-parallel graphs 

A series-parallel graph is a graph that can be built using series and parallel composition. Formally, 
a series-parallel graph G(x, y) with distinguished vertices x, y is an undirected graph that can be 
constructed using the following rules: 

• An edge xy is a series-parallel graph. 

• If Gi(xi,yi) and ^2(3:2, 2/2) are series-parallel graphs, then the graph G(x,y) obtained by 
identifying x\ with X2 and y± with yi is a series-parallel graph with distinguished vertices 
x := x\ = X2 and y := y% = j/2 (parallel connection). 

• If Gi(xi,yi) and ^2(3:2, 2/2) are series-parallel graphs, then the graph G(x,y) obtained by 
identifying y\ with X2 is a series-parallel graph with distinguished vertices x := x\ and y := 2/2 
(series connection). 

We prove Theorem[5]in this section by constructing a polynomial-time algorithm to solve Steiner 
Forest on series-parallel graphs. It is well-known that the treewidth of a graph is at most 2 if and 
only if it is a subgraph of a series parallel graph [Bod98]. Since setting the length of an edge to 00 is 
essentially the same as deleting the edge, it follows that Steiner Forest can be solved in polynomial 
time on graphs with treewidth at most 2. 

Let (G,T>) be an instance of Steiner Forest where G is series parallel. For % = 1, ... ,m, denote 
by Gi(xi,yi) all the intermediary graphs appearing in the series-parallel construction of G. We 
assume that these graphs are ordered such that G = G m and if Gi is obtained from Gj t and Gj 2 , 
then j\,j2 < i- Let T>i C T> contain those pairs {u,v} where both vertices are in V(Gi). Let Ai 
be those vertices v € V(Gi) for which there exists a pair {v,u} € T> with u V(Gi) (note that 
A m = and V m = T>) . For every Gi , we define two integer values aj , bi and a function /j : 

• Let Ui be the minimum cost of a solution F of the instance (Gi,T>i) with the additional 
requirements that Xi and yi are connected in F and every vertex in Ai is in the same component 
as Xi and yi. 

• Let G\ be the graph obtained from Gi by identifying vertices Xi and yi. Let bi be the minimum 
cost of a solution F of the instance {G' i ,T>i) with the additional requirement that Ai is in the 
same component as Xi = y^. 
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• For every S C Ai, let fi(S) be the minimum cost of a solution F of the instance (Gi,T>i) 
with the additional requirements that x% and yi are not connected, every v G S 1 is in the same 
component as Xi, and every u G \ S 1 is in the same component as y^. (If there is no such 
F, then /i(5) = oo.) 

The main combinatorial property that allows us to solve the problem in polynomial time is that 
the functions /, are submodular. We prove something stronger: the functions /, can be represented 
in a compact way as the cut function of certain directed graphs. 

If D is a directed graph length on the edges and X C V(D), then 5d(X) denotes the total 
length of the edges leaving X. For X, Y C V(D), we denote by \d(A, B) the minimum length of a 
directed cut that separates A from B, i.e., the minimum of Sz>(X), taken over all A C X C V(D)\B 
(if A n £ ^ 0, then A D (A 5) is defined to be oo). 

Definition 23. Let Di be a directed graph with nonnegative lengths. Let Sj and ij be two distin- 
guished vertices and let Ai be a subset of vertices of Di. We say that (Di, Si,U, Ai) represents fi if 
fi(S) = ^D t (S U {si}, (Ai \ S) U {ti}) for every S 1 C ylj. If Sj, £3, Aj are clear from the context, then 
we simply say that Di represents fa. 

A function / defined on the subsets of a ground set U is submodular if 

f(X) + f(Y) > f(X n Y) + /(X U Y) 

holds for every I,Y C JJ. For example, it is well known that 8q(X) is a submodular function on 
the subsets of V(G). Submodularity is a powerful unifying concept of combinatorial optimization: 
classical results on flows, cuts, matchings, and matroids can be considered as consequences of sub- 
modularity. The following (quite standard) proposition shows that if a function can be represented 
in the sense of Definition [23l then the function is submodular. In the proof of Theorem HI we 
show that every function fi can be represented by a directed graph, thus it follows that every /, 
is submodular. Although we do not use this observation directly in the paper, it explains in some 
sense why the problem is polynomial-time solvable. 

Proposition 24. If a function : 2 Ai — > M. + can be represented by (Di, Si, ti, Ai) (in the sense of 
Definition [23l> . then fi is submodular. 

Proof. Let X,Y Q Ai be arbitrary sets. Since Di represents fi, there exist appropriate sets X' and 
Y' with 6 Di (X') = f\(X) and 8 Di (Y') = fa(Y). Now we have 

fi(X) + U(Y) = 5 Di (X') + 5 Di (Y') > 8 Di (X' n Y') + 5 Di (X' U Y'), (11) 

where the inequality follows from the submodularity of <5d;- Observe that (X C\Y) U {s} C X' n Y' 
and X' n Y' C V(A) \ ((A \ X) U (^ \ Y) U {*}). Thus we have f t (X n Y) < <5 A (X' D Y'). In a 
similar way, U Y) < 8r) i (X l U Y'). Together with Inequality (jllh obtained above, this proves 
that/ i (X) + / i (Y)>/ i (XnY) + / i (XUY). □ 

Proof of Theorem [^j We assume that in the given instance of Steiner Forest each vertex appears 
only in at most one pair of T>. To achieve this, if a vertex v appears in k > 1 pairs, then we 
subdivide an arbitrary edge incident to v by k — 1 new vertices such that each of the k — 1 edges 
on the path formed by v and the new vertices has length 0. Replacing vertex v in a pair by any of 
the new vertices does not change the problem. 
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For every i = 1, . . . , m, we compute the values aj, 6j, and a representation A of /j. In the 
optimum solution F for the instance (G m ,T>), vertices x m and y m are either connected or not. 
Thus the cost of the optimum solutions is the minimum of a m and / m (0) (recall that A m = 0). The 
value of / m (0) can be easily determined by computing the minimum cost s-t cut in D m . 

If Gi is a single edge e, then ctj and b{ are trivial to determine: ctj is the length of e and 6, = 0. 
The directed graph A representing /j can be obtained from Gi by renaming Xj to Sj, renaming ?/j 
to ij, and either removing the edge e (if T>i = 0) or replacing e with a directed edge Sjij of length 
oo (if {u, v} G T>i). 

If Gi is not a single edge, then it is constructed from some Gj 1 and Gj 2 either by series or 
parallel connection. Suppose that ctj , bj p , and Dj for p = 1,2 are already known. We show how 
to compute ctj, 6j, and -Dj in this case. 

Parallel connection. Suppose that Gi is obtained from Gj 1 and Gj 2 by parallel connection. 
It is easy to see that ctj = min{aj 1 + 6j 2 , 6^ + aj 2 } and 6j = 6^ + bj 2 . To obtain Dj, we join Dj x and 
A' 2 by identifying Sj ± with Sj 2 (call it Sj) and by identifying tj ± with fj 2 (call it tj). Furthermore, 
for every {«, € A \ {T>j 1 U A 2 }' we add directed edges uv and vit with length oo. 

To see that Di represents /j, suppose that F is the subgraph that realizes the value fi(S) for 
some S C Aj. We have to show that there is an appropriate X C F(A) certifying AD i (5'U{s}, (Aj\ 
S) U {i}) < ^(F). The graph F is the edge disjoint union of two graphs F\ C and F2 C G J2 . For 
p = 1, 2, let 5 P C Aj p be the set of those vertices that are connected to Xj p in F p , it is clear that F p 
connects Aj p \ S p to yj p . Since F p does not connect Xj and yj, we have that i(F p ) > fj p (S p ). Since 
D jp represents f jp , there is a set X p in D jp with S* p U {s ip } C X p C V(A) \ \ -S* p ) U {ij p }), and 
5 D3p (X p ) = (S" p ). We show that fo. (X x U X 2 ) = <5 Dji (X ± ) + <5 Dj2 (X 2 ). Since A is obtained from 
joining Ah and A' 2 ' ^ e oru y thing that has to be verified is that the edges with infinite length 
added after the join cannot leave X\ U X 2 . Suppose that there is such an edge uv, assume without 
loss of generality that u G X± and v G V(Dj 2 ) \ X 2 . This means that u G S 1 and v $ S 2 . Thus F 
connects u with Xj and v with yj, implying that F does not connect u and v. However {u, v} G A> 
a contradiction. Therefore, for the set X := X\ U X2, we have 

6 Di (X) = Sn^Xi) + S Dj2 (X 2 ) = f n (S r ) + f n (S 2 ) < t{F x ) + t(F 2 ) = 1(F) = f t (S), 

proving the existence of the required X. 

Suppose now that for some S C A i: there is a set X with SU-f^} C X C F(A)\((^A*S') U {^})- 
We have to show that ^(X) > /j(S'). For p = 1,2, let l„ = In V(D jp ) and S" p = A ip n X p . 
Since A P represents fj p , we have that <$o.j (-Xp) > fj p (S p ). Let F p be a subgraph of Gj p realizing 
fj p (S p ). Let F = Fi U F 2 ; we show that £(F) > fi(Ai), since F satisfies all the requirements in the 
definition of fi(Ai). It is clear that F does not connect Xj and yj. Consider a pair {«, G A- If 
{u, f} G A P 5 then F connects u and u. Otherwise, let {u, v} G A \ {Ai UP 32 }. Suppose that F 
does not connect u and u, without loss of generality, suppose that u G X\ and w ^ X 2 . This means 
that there is an edge uv of length 00 in A and S^iX) = 00 > /j(S') follows. Thus we proved 
Z{F) > fi(Ai) an< i we nave 

«5 Di (X) = Sn^Xi) + 6 Dj2 (X 2 ) > f n (S l ) + f n {S 2 ) = t{F x ) + t{F 2 ) = 1(F) > f t (S), 

what we had to show. 

Series connection. Suppose that Gi is obtained from Gj l and Gj 2 by series connection and 
let fi := yj 1 = Xj 2 be the middle vertex. It is easy to see that ctj = aj 1 + dj 2 (vertex \x has to be 
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Figure 1: Construction of Di in a series connection. 



connected to both X{ and yi). To compute &j, we argue as follows. Denote by G^ 2 the graph obtained 
from Gj 2 by swapping the names of distinguished vertices Xj 2 and yj 2 . Observe that the graph G\ 
in the definition of 6j arises as the parallel connection of Gj 1 and G^. It is easy to see that a^ 2 , 
bf 2 , and corresponding to Gf 2 can be defined as af 2 = a,j 2 , bf 2 = bj 2 , and f^(S) = fj 2 (Aj 2 \ S). 
Furthermore, if Dj 2 represents fj 2 , then the graph D$ obtained from Dj 2 by swapping the names 
of Sj 2 and tj 2 represents fP. Thus we have everything at our disposal to construct a directed graph 
D[ that represents the function j[ corresponding to the parallel connection of Gj x and G^. Now 
it is easy to see that bi = f-(Ai): graph G\ is isomorphic to the parallel connection of Gj 1 and 
G^ 2 and the definition of b{ requires that Ai is connected to x\ = yi. The value of fl(Ai) can be 
determined by a simple minimum cut computation in D[. 

Let T\ C Aj x contain those vertices v for which there exists a pair {v, u} € T>i with u G Aj 2 and 
let T2 C Aj 2 contain those vertices v for which there exists a pair {v, u} G T>{ with u E ^4,^. Observe 
that Ai = (Aj 1 \ Ti) U (Aj 2 \T2) (here we are using the fact that each vertex is contained in at most 
one pair, thus a v £ T\ cannot be part of any pair {v,u} with u V(Di)). To construct Di, we 
connect Dj 1 and Dj 2 with an edge tj 1 Sj 2 of length and set Sj := and tj := Furthermore, 
we introduce two new vertices 71 , 72 and add the following edges (see Figure [T]) : 

• s j 1 71 with length Oj 2 , 

• tIt^ with length f h {A h \ T X ) + f j2 (T 2 ), 

• 72*72 with length , 

• 7271 with length 00, 

• 71 v with length 00 for every v € V{Dj 1 ), 

• vyi with length 00 for every v € V(-Dj 2 ), 

• U71 with length 00 for every v E T\, and 

• 72V with length cxd for every v E T 2 . 

Suppose that i 7 is the subgraph that realizes the value fi(S) for some S C i;; we have to 
show that Di has an appropriate cut with value fi(S). Subgraph F is the edge-disjoint union of 
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subgraphs F\ C Gj 1 and F2 C Gj 2 . We consider 3 cases: in subgraph F, vertex \i is either connected 
to neither Xj x nor tjj 2 , connected only to Xj ± , or connected only to yj 2 . 

Case 1: fi is connected to neither Xj x nor yj 2 . In this case, vertices of Aj x are not connected 
to yi and vertices of Aj 2 are not connected to Xi, hence 5 = Ai n = \ T\ is the only 
possibility. Furthermore, F connects both T\ and T2 to /i. It follows that 1(F) = £(F L ) + £{F 2 ) > 
/ jl (^ 1 \T , i) + / j2 (T 2 ). SetX = V(D i )U{ 11 }: now we have 5 Di (X) = f jl (A jl \T 1 ) + f j2 (T 2 ) < £(F), 
X contains (Aj 1 \ T\) U {sj}, and is disjoint from (A, 2 \ T 2 ) U {ij}. 

Case 2: /i is connected only to X{. This is only possible if Aj x \ T\ C. S. Clearly, £{F\) > cij 1 . 
Subgraph F2 has to connect every vertex in (SCiAj 2 ){JT2 to Xj 2 and every vertex Ai\S = Aj 2 \(S{JT 2 ) 
to yj 2 . Hence £(F 2 ) > fj 2 ((S H Aj 2 ) U T2) and let X2 C l^(Dj 2 ) be the corresponding cut in Fa^. 
Set X := X 2 U FpjJ U {71, 72}, we have 6 Di (X) = f j2 {(S n A,- 2 ) U T 2 ) + a h < £(F 2 ) + < £{F) 
(note that no edge with infinite length leaves X since T 2 C X 2 ). As X contains S and contains 
none of the vertices in Ai \ S, we proved the existence of the required cut. 

Case 3: /x is connected only to y». Similar to case 2. 

Suppose now that for some S Q Ai, there is a set X C V{Di) with 5 U {s{} C X C F(-Dj) \ 
((Aj \ 5) U {tj}). We have to show that > fi(S). If ^(X) = 00, then there is nothing to 

show. In particular, because of the edge 7271, we are trivially done if 7 2 € X and 71 ^ X. Thus 
we have to consider only 3 cases depending which of 71,72 are contained in X. 

Case 1: 71 G X, 7 2 ^ X. In this case, the edges having length 00 ensure that V(Dj 1 ) C X 
and VCPfc) D X = 0, thus ^(X) = £( : jr^) + l(t h Sj 2 ) = f jl {A jl \ Ti) + fj 2 ( T h)- We also have 
5 = \ Ti. Now it is easy to see that fi(S) < fj 1 (Aj 1 \ T\) + fj 2 (Tj 2 ): taking the union of some 
Fi realizing fj 1 (Aj 1 \ Ti) and some F 2 realizing fj 2 (Tj 2 ), we get a subset F realizing fi(S). 

Case 2: 71,72 £ X. The edges starting from 71 and having length 00 ensure that V{Dj 1 ) C X. 
Furthermore, 72 S X ensures that T 2 C X. Let X 2 := XnV(Dj 2 ), we have X 2 rij4j 2 = T 2 U(5nA j2 ), 
which implies (X 2 ) > fj 2 (T2 U (5 n Aj 2 )). Observe that fe^X) = aj 1 + <5£) J2 (X 2 ) (the term 
comes from the edge 72^). Let Fx be a subset of Gj 1 realizing aj 1 and let F 2 be a subset of Gj 2 
realizing fj 2 (T2Li(S(lAj 2 )). Let F := FiUF 2 , note that F connects vertices S with Xj, vertices Ai\S 
with yj, and vertices in T\ U T 2 with \x. Thus < ^(F) = aj 1 + fj 2 (T 2 U (5 D ^4j 2 )) = $Di(X), 

what we had to show. 

Case 3: 71,72 ^ X. Similar to Case 2. □ 

8 Hardness for treewidth 3 

In this section, we show that Steiner Forest is NP-hard on graphs with treewidth at most 3. Very 
recently, this has been proved independently by Gassner }Gas 09] , but our compact proof perhaps 
better explains what the reason is for the sharp difference between the series-parallel and the 
treewidth 3 cases. 

Consider the graph in Figure [2] and let us define the function / analogously to the function 
fi in Section [7) for every S C {1,2,3}, let f(S) be the minimum cost of a subgraph F where 
x and y are not connected, ti is connected to x for every i 6 S, and ti is connected to y for 
every i € {1,2,3} \ S; if there is no such subgraph F, then let f(S) = 00. It is easy to see that 
/({1,2}) = /({2,3}) = /({1,2,3}), while /({2}) = 00. Thus, unlike in the case of series-parallel 
graphs, this function is not submodular. 

We use the properties of the function / defined in the previous paragraph to obtain a hardness 
proof in a more or less "automatic" way. Let us define the Boolean relation R(a, b, c) := (a = 
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Figure 2: The graph used in the proof Theorem 1261 



c) V (6 = c). Observe that for any S 1 C {1, 2, 3}, we have f(S) = 3 if R(l G 5, 2 G S 1 , 3 G S 1 ) = 1 and 
/(S*) = oo otherwise (here 1 G 5 means the Boolean variable that is 1 if and only if 1 G S). An 
R-formula is a conjunction of clauses, where each clause is the relation R applied to some Boolean 
variables or to the constants and 1, e.g., R(x\, 0, X4) A R(0,X2,x\) A R(x^,X2, 1). In the R-Sat 
problem, the input is an i?-formula and it has to be decided whether the formula has a satisfying 
assignment. 

Lemma 25. R-Sat is NP-complete. 

Proof. Readers familiar with Schaefer's Dichotomy Theorem (more precisely, the version allowing 
constants |Sch781 Lemma 4.1]) can easily see that i?-SAT is NP-complete. It is easy to verify that 
the relation R is neither weakly positive, weakly negative, affine, nor bijunctive. Thus the result of 
Schaefer immediately implies that R-Sat is NP-complete. 

For completeness, we give a simple self-contained proof here. The reduction is from Not-All- 



Equal 3SA which is known to be NP-complete even if there are no negated literals [Sch78] . Given 
a NAE-SAT formula, we replace each clause as follows. For each clause NAE(a, b, c), we introduce 
a new variable d and create the clauses R(a, b, d) A R(c, d, 0) A R(c, d,l). If a = b = c, then it is 
not possible that all three clauses are simultaneously satisfied (observe that the second and third 
clauses force d). On the other hand, if a, b, c do not have the same value, then all three clauses 
can be satisfied by an appropriate choice of d. Thus the transformation from NAE-SAT to R-Sat 
preserves satisfiability. □ 

The main idea of the following proof is that we can simulate arbitrarily many i?-relations by 
joining in parallel copies of the graph shown in Figure 

Theorem 26. Steiner Forest is NP-hard on planar graphs with treewidth at most 3. 

Proof. The proof is by reduction from R-Sat. Let <ft be an i?-formula having n variables and m 
clauses. We start the construction of the graph G by introducing two vertices vq and v\. For each 
variable Xj of <f>, we introduce a vertex Xi and connect it to both vq and v\. We introduce 3 new 
vertices a^, &j, q corresponding to the i-th clause. Vertices en and 6, are connected to both and 
v%, while Q is adjacent only to en and 6j. If the i-th clause is R(xi 1 ,Xi 2 ,X{ 3 ), then we add the 3 
pairs {xj^Oj}, {xi 2 ,bi}, {xj 3 ,Cj} to T>. If the clause contains constants, then we use the vertices 

4 In Not-All-Equal 3SAT, or NAE-3SAT for short, we are given a 3SAT instance with the extra restriction that a 
clause is not satisfied if all the literals in a clause are true. Similarly to 3 SAT, the clause is not satisfied if all the 
literals are false, either. Thus, the literals in each clause have to take both true and false values. 
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vq and v\ instead of the vertices Xj i; Xj 2 , Xi 3 , e.g., the clause R(0, Xj 2 , 1) yields the pairs {t>o,aj}, 
{xi 2 , bi}, {v\,Ci}. The length of every edge is 1. This completes the description of the graph G and 
the set of pairs V. 

We claim that the constructed instance of Steiner Forest has a solution with n + 3m edges if and 
only if the i?-formula (f> is satisfiable. Suppose that <j> has a satisfying assignment /. We construct 
the F as follows. If /(xj) = 1, then let us add edge X{V± to F; if /(xj) = 0, then let us add edge 
XiVo to F. For each clause, we add 3 edges to F. Suppose that the z-th clause is i?(xi 15 Xi 2 , Xj 3 ). 
We add one of diVo or aiv\ to F depending on the value of /(x^) and we add one of biVo or biV\ 
to F depending on the value of f{xi 2 ). Since the clause is satisfied, either f(xi s ) = /(xjj or 
f(xi 3 ) = /(xj 2 ); we add QOj or Cj6j to F, respectively (if /(xj 3 ) is equal to both, then the choice 
is arbitrary). We proceed in an analogous manner for clauses containing constants. It is easy to 
verify that each pair is in the same connected component of F. 

Suppose now that there is a solution F with cost n + 3m. At least one edge is incident with 
each vertex x,, since it cannot be isolated in F. Each vertex a^, bi, C{ has to be connected to either 
vq or vi, hence at least 3 edges of F are incident with these 3 vertices. As F has n + 3m edges, it 
follows that exactly one edge is incident with each Xj, and hence exactly 3 edges are incident with 
the set {ai,bi,Ci}. It follows that vq and v± are not connected in F. Define an assignment of (f> by 
setting /(xj) = if and only if vertex Xj is in the same component of F as v$. To verify that a 
clause i?(xi 15 Xj 2 , Xj 3 ) is satisfied, observe that q is in the same component of F as either or 6j. 
If Cj is in the same component as, say, aj, then this component also contains Xj 3 and x^, implying 
/(x i3 ) = f(x h ) as required. □ 
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